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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(i) APPLICANT: Artavanis-Tsakonas, S. et al. 

(i'l) TITLE OF INVENTIONS Therapeutic And Diagnostic Methods 
And Compositions Based On Notch Proteins And 
Nucleic Ac ids! f ill 

(iii) NUMBER OF SEQUENCES: 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/083,590 

(B) FILING DATE: 25-JUN-1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Misrock, S. Leslie 

(B) REGISTRATION NUMBER: 18,872 

(C) REFERENCE/DOCKET NUMBER: 7326-015 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 212 790-9090 

(B) TELEFAX: 212 8698864/9741 

(C) TELEX: 66141 PENNIE 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2892 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

( ii ) MOLECULE TYPE : cDNA 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 142.. 2640 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GAATTCGGAG GAATTATTCA AAACATAAAC ACAATAAACA ATTTGAGTAG TTGCCGCACA 
CACACACACA CACAGCCCGT GGATTATTAC ACT AAAAG CG ACACTCAATC CAAAAAATCA 



GCAACAAAAA CAT C AAT AAA C ATG CAT TGG ATT AAA TGT TTA TTA AC A GCA 



• 



-90- 

Met His Trp lie Lys Cys Leu Leu Thr Ala 
1 5 10 

TTC ATT TGG TTC ACA GTC ATC GTG CAG GTT CAC AGT TCC GGC AGC TTT 219 
Phe lie Cys Phe Thr Val lie Val Gin Val His Ser Ser Gly Ser Phe 

15 20 25 

GAG TTG CGC CTG AAG TAC TTC AGC AAC GAT CAC GGG CGG GAC AAC GAG 267 
Glu Le'u Arg Leu Lys Tyr Phe Ser Asn Asp His Gly Arg Asp Asn Glu 

30 35 40 

GGT CGC TGC TGC AGC GGG GAG TCG GAC GGA GCG ACG GGC AAG TGC CTG 315 

Gly Arg Cys Cys Ser Gly Glu Ser Asp Gly Ala Thr Gly Lys Cys Leu 

45 50 55 

GGC AGC TGC AAG ACG CGG TTT CGC GTC TGC CTA AAG CAC TAC CAG GCC 363 
Gly Ser Cys Lys Thr Arg Phe Arg Val Cys Leu Lys His Tyr Gin Ala 
60 65 70 

ACC ATC GAC ACC ACC TCC CAG TGC ACC TAC GGG GAC GTG ATC ACG CCC 411 
Thr lie Asp Thr Thr Ser Gin Cys Thr Tyr Gly Asp Val lie Thr Pro 
75 80 85 90 

ATT CTC GGC GAG AAC TCG GTC AAT CTG ACC GAC GCC CAG CGC TTC CAG 459 
lie Leu Gly Glu Asn Ser Val Asn Leu Thr Asp Ala Gin Arg Phe Gin 

95 100 105 

AAC AAG GGC TTC ACG AAT CCC ATC CAG TTC CCC TTC TCG TTC TCA TGG 507 
Asn Lys Gly Phe Thr Asn Pro lie Gin Phe Pro Phe Ser Phe Ser Trp 

110 115 120 

CCG GGT ACC TTC TCG CTG ATC GTC GAG GCC TGG CAT GAT ACG AAC AAT 555 
Pro Gly Thr Phe Ser Leu lie Val Glu Ala Trp His Asp Thr Asn Asn 
125 130 135 

AGC GGC AAT GCG CGA ACC AAC AAG CTC CTC ATC CAG CGA CTC TTG GTG 603 
Ser Gly Asn Ala Arg Thr Asn Lys Leu Leu lie Gin Arg Leu Leu Val 
140 145 150 

CAG CAG GTA CTG GAG GTG TCC TCC GAA TGG AAG ACG AAC AAG TCG GAA 651 
Gin Gin Val Leu Glu Val Ser Ser Glu Trp Lys Thr Asn Lys Ser Glu 
155 160 165 170 

TCG CAG TAC ACG TCG CTG GAG TAC GAT TTC CGT GTC ACC TGC GAT CTC 699 
Ser Gin Tyr Thr Ser Leu Glu Tyr Asp Phe Arg Val Thr Cys Asp Leu 

175 180 185 

AAC TAC TAC GGA TCC GGC TGT GCC AAG TTC TGC CGG CCC CGC GAC GAT 747 
Asn Tyr Tyr Gly Ser Gly Cys Ala Lys Phe Cys Arg Pro Arg Asp Asp 

190 195 200 

TCA TTT GGA CAC TCG ACT TGC TCG GAG ACG GGC GAA ATT ATC TGT TTG 795 
Ser Phe Gly His Ser Thr Cys Ser Glu Thr Gly Glu lie lie Cys Leu 
205 210 215 

ACC GGA TGG CAG GGC GAT TAC TGT CAC ATA CCC AAA TGC GCC AAA GGC 843 
Thr Gly Trp Gin Gly Asp Tyr Cys His lie Pro Lys Cys Ala Lys Gly 
220 225 230 

TGT GAA CAT GGA CAT TGC GAC AAA CCC AAT CAA TGC GTT TGC CAA CTG 891 
Cys Glu His Gly His Cys Asp Lys Pro Asn Gin Cys Val Cys Gin Leu 
235 240 245 250 

GGC TGG AAG GGA GCC TTG TGC AAC GAG TGC GTT CTG GAA CCG AAC TGC 939 
Gly Trp Lys Gly Ala Leu Cys Asn Glu Cys Val Leu Glu Pro Asn Cys 

255 260 265 
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ATC CAT GGC ACC TGC AAC AAA CCC TGG ACT TGC ATC TGG AAC GAG GGT 987 
lie His Gly Thr Cys Asn Lys Pro Trp Thr Cys lie Cys Asn Glu Gly 

270 275 280 

TGG GGA GGC TTG TAC TGC AAC CAG GAT CTG AAC TAC TGC ACC AAC CAC 1035 
Trp Gly Gly Leu Tyr Cys Asn Gin Asp Leu Asn Tyr Cys Thr Asn His 
285 290 295 

AGA CCfc TGC AAG AAT GGC GGA ACC TGC TTC AAC ACC GGC GAG GGA TTG 1083 
Arg Pro Cys Lys Asn Gly Gly Thr Cys Phe Asn Thr Gly Glu Gly Leu 
300 305 310 

TAC AGA TGC AAA TGC GCT CCA GGA TAC AGT GGT GAT GAT TGC GAA AAT 1131 
Tyr Thr Cys Lys Cys Ala Pro Gly Tyr Ser Gly Asp Asp Cys Glu Asn 
315 320 325 330 

GAG ATC TAC TCC TGC GAT GCC GAT GTC AAT CCC TGC CAG AAT GGT GGT 1179 
Glu lie Tyr Ser Cys Asp Ala Asp Val Asn Pro Cys Gin Asn Gly Gly 

335 340 345 

ACC TGC ATC GAT GAG CCG CAC ACA AAA ACC GGC TAC AAG TGT CAT TGC 1227 
Thr Cys lie Asp Glu Pro His Thr Lys Thr Gly Tyr Lys Cys His Cys 

350 355 360 

GCC AAC GGC TGG AGC GGA AAG ATG TGC GAG GAG AAA GTG CTC ACG TGT 1275 
Ala Asn Gly Trp Ser Gly Lys Met Cys Glu Glu Lys Val Leu Thr Cys 
365 370 375 

TCG GAC AAA CCC TGT CAT CAG GGA ATC TGC CGC AAC GTT CGT CCT GGC 1323 
Ser Asp Lys Pro Cys His Gin Gly lie Cys Arg Asn Val Arg Pro Gly 
380 385 390 

TTG GGA AGC AAG GGT CAG GGC TAC CAG TGC GAA TGT CCC ATT GGC TAC 1371 
Leu Gly Ser Lys Gly Gin Gly Tyr Gin Cys Glu Cys Pro lie Gly Tyr 
395 400 405 410 

AGC GGA CCC AAC TGC GAT CTC CAG CTG GAC AAC TGC AGT CCG AAT CCA 1419 
Ser Gly Pro Asn Cys Asp Leu Gin Leu Asp Asn Cys Ser Pro Asn Pro 

415 420 425 

TGC ATA AAC GGT GGA AGC TGT CAG CCG AGC GGA AAG TGT ATT TGC CCA 1467 
Cys lie Asn Gly Gly Ser Cys Gin Pro Ser Gly Lys Cys lie Cys Pro 

430 435 440 

GCG GGA TTT TCG GGA ACG AGA TGC GAG ACC AAC ATT GAC GAT TGT CTT 1515 
Ala Gly Phe Ser Gly Thr Arg Cys Glu Thr Asn lie Asp Asp Cys Leu 
445 450 455 

GGC CAC CAG TGC GAG AAC GGA GGC ACC TGC ATA GAT ATG GTC AAC CAA 1563 
Gly His Gin Cys Glu Asn Gly Gly Thr Cys lie Asp Met Val Asn Gin 
460 465 470 

TAT CGC TGC CAA TGC GTT CCC GGT TTC CAT GGC ACC CAC TGT AGT AGC 1611 
Tyr Arg Cys Gin Cys Val Pro Gly Phe His Gly Thr His Cys Ser Ser 
475 480 485 490 

AAA GTT GAC TTG TGC CTC ATC AGA CCG TGT GCC AAT GGA GGA ACC TGC 1659 
Lys Vari Asp Leu Cys Leu lie Arg Pro Cys Ala Asn Gly Gly Thr Cys 

495 500 505 

TTG AAT CTC AAC AAC GAT TAC CAG TGC ACC TGT CGT GCG GGA TTT ACT 1707 
Leu Asn Leu Asn Asn Asp Tyr Gin Cys Thr Cys Arg Ala Gly Phe Thr 

510 515 520 

GGC AAG GAT TGC TCT GTG GAC ATC GAT GAG TGC AGC AGT GGA CCC TGT 1755 
Gly Lys Asp Cys Ser Val Asp lie Asp Glu Cys Ser Ser Gly Pro Cys 
525 530 535 
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CAT AAC GGC GGC ACT TGC ATG AAC CGC GTC AAT TCG TTC GAA TGC GTG 1803 
His Asn Gly Gly Thr Cys Met Asn Arg Val Asn Ser Phe Glu Cys Val 
540 545 550 

TGT GCC AAT GGT TTC AGG GGC AAG CAG TGC GAT GAG GAG TCC TAC GAT 1851 
Cys Ala Asn Gly Phe Arg Gly Lys Gin Cys Asp Glu Glu Ser Tyr Asp 
555 560 565 570 

TCG GTfc ACC TTC GAT GCC CAC CAA TAT GGA GCG ACC ACA CAA GCG AGA 1899 
Ser Val Thr Phe Asp Ala His Gin Tyr Gly Ala Thr Thr Gin Ala Arg 

575 580 585 

GCC GAT GGT TTG ACC AAT GCC CAG GTA GTC CTA ATT GCT GTT TTC TCC 1947 
Ala Asp Gly Leu Thr Asn Ala Gin Val Val Leu lie Ala Val Phe Ser 

590 595 600 

GTT GCG ATG CCT TTG GTG GCG GTT ATT GCG GCG TGC GTG GTC TTC TGC 1995 
Val Ala Met Pro Leu Val Ala Val lie Ala Ala Cys Val Val Phe Cys 
605 610 615 

ATG AAG CGC AAG CGT AAG CGT GCT CAG GAA AAG GAC GAC GCG GAG GCC 2043 
Met Lys Arg Lys Arg Lys Arg Ala Gin Glu Lys Asp Asp Ala Glu Ala 
620 625 630 

AGG AAG CAG AAC GAA CAG AAT GCG GTG GCC ACA ATG CAT CAC AAT GGC 2091 
Arg Lys Gin Asn Glu Gin Asn Ala Val Ala Thr Met His His Asn Gly 
635 640 645 650 

AGT GGG GTG GGT GTA GCT TTG GCT TCA GCC TCT CTG GGC GGC AAA ACT 2139 
Ser Gly Val Gly Val Ala Leu Ala Ser Ala Ser Leu Gly Gly Lys Thr 

655 660 665 

GGC AGC AAC AGC GGT CTC ACC TTC GAT GGC GGC AAC CCG AAT ATC ATC 2187 
Gly Ser Asn Ser Gly Leu Thr Phe Asp Gly Gly Asn Pro Asn lie lie 

670 675 680 

AAA AAC ACC TGG GAC AAG TCG GTC AAC AAC ATT TGT GCC TCA GCA GCA 2235 
Lys Asn Thr Trp Asp Lys Ser Val Asn Asn lie Cys Ala Ser Ala Ala 
685 690 695 

GCA GCG GCG GCG GCG GCA GCA GCG GCG GAC GAG TGT CTC ATG TAC GGC 2283 
Ala Ala Ala Ala Ala Ala Ala Ala Ala Asp Glu Cys Leu Met Tyr Gly 
700 705 710 

GGA TAT GTG GCC TCG GTG GCG GAT AAC AAC AAT GCC AAC TCA GAC TTT 2331 
Gly Tyr Val Ala Ser Val Ala Asp Asn Asn Asn Ala Asn Ser Asp Phe 
715 720 725 730 

TGT GTG GCT CCG CTA CAA AGA GCC AAG TCG CAA AAG CAA CTC AAC ACC 2379 
Cys Val Ala Pro Leu Gin Arg Ala Lys Ser Gin Lys Gin Leu Asn Thr 

735 740 745 

GAT CCC ACG CTC ATG CAC CGC GGT TCG CCG GCA GGC AGC TCA GCC AAG 2427 
Asp Pro Thr Leu Met His Arg Gly Ser Pro Ala Gly Ser Ser Ala Lys 

750 755 760 

GGA GCG TCT GGC GGA GGA CCG GGA GCG GCG GAG GGC AAG AGG ATC TCT 2475 
Gly Ala Ser Gly Gly Gly Pro Gly Ala Ala Glu Gly Lys Arg lie Ser 
765 770 775 

GTT TTA GGC GAG GGT TCC TAC TGT AGC CAG CGT TGG CCC TCG TTG GCG 2523 
Val Leu Gly Glu Gly Ser Tyr Cys Ser Gin Arg Trp Pro Ser Leu Ala 
780 785 790 



GCG GCG GGA GTG GCC GGA GCC TGT TCA TCC CAG CTA ATG GCT GCA GCT 
Ala Ala Gly Val Ala Gly Ala Cys Ser Ser Gin Leu Met Ala Ala Ala 
795 800 805 810 



2571 
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TCG GCA GCG GGC AGO GGA GCG GGG ACG GCG CAA CAG CAG CGA TCC GTG 2619 
Ser Ala Ala Gly Ser Gly Ala Gly Thr Ala Gin Gin Gin Arg Ser Val 

815 820 825 

GTC TGC GGC ACT CCG CAT ATG TAACTCCAAA AATCCGGAAG GGCTCCTGGT 2670 
Val Cys Gly Thr Pro His Met 

830 

AAATCCGGAG AAATCCGCAT GGAGGAGCTG ACAGCACATA CACAAAGAAA AGACTGGGTT 2730 

GGGTTCAAAA TGTGAGAGAG ACGCCAAAAT GTTGTTGTTG ATTGAAGCAG TTTAGTCGTC 2790 

ACGAAAAATG AAAAATCTGT AACAGGCATA ACTCGTAAAC TCCCTAAAAA ATTTGTATAG 2850 

TAATTAGCAA AGCTGTGACC CAGCCGTTTC GATCCCGAAT TC 2892 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met His Trp lie Lys Cys Leu Leu Thr Ala Phe lie Cys Phe Thr Val 
15 10 15 

lie Val Gin Val His Ser Ser Gly Ser Phe Glu Leu Arg Leu Lys Tyr 

20 25 30 

Phe Ser Asn Asp His Gly Arg Asp Asn Glu Gly Arg Cys Cys Ser Gly 

35 40 45 

Glu Ser Asp Gly Ala Thr Gly Lys Cys Leu Gly Ser Cys Lys Thr Arg 
50 55 60 

Phe Arg Val Cys Leu Lys His Tyr Gin Ala Thr lie Asp Thr Thr Ser 
65 70 75 80 

Gin Cys Thr Tyr Gly Asp Val lie Thr Pro lie Leu Gly Glu Asn Ser 

85 90 95 

Val Asn Leu Thr Asp Ala Gin Arg Phe Gin Asn Lys Gly Phe Thr Asn 

100 105 110 

Pro lie Gin Phe Pro Phe Ser Phe Ser Trp Pro Gly Thr Phe Ser Leu 
115 120 125 

lie Val Glu Ala Trp His Asp Thr Asn Asn Ser Gly Asn Ala Arg Thr 
130 135 140 

Asn Lys Leu Leu lie Gin Arg Leu Leu Val Gin Gin Val Leu Glu Val 
145 150 155 160 

Ser Ser Glu Trp Lys Thr Asn Lys Ser Glu Ser Gin Tyr Thr Ser Leu 

165 170 175 

Glu Tyr Asp Phe Arg Val Thr Cys Asp Leu Asn Tyr Tyr Gly Ser Gly 

180 185 190 

Cys Ala Lys Phe Cys Arg Pro Arg Asp Asp Ser Phe Gly His Ser Thr 
195 200 205 
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Cys Ser Glu Thr Gly Glu lie lie Cys Leu Thr Gly Trp Gin Gly Asp 
210 215 220 

Tyr Cys His lie Pro Lys Cys Ala Lys Gly Cys Glu His Gly His Cys 
225 230 235 240 

Asp Lys Pro Asn Gin Cys Val Cys Gin Leu Gly Trp Lys Gly Ala Leu 

245 250 255 

Cys Asn Glu Cys Val Leu Glu Pro Asn Cys lie His Gly Thr Cys Asn 

260 265 270 

Lys Pro Trp Thr Cys He Cys Asn Glu Gly Trp Gly Gly Leu Tvr Cvs 
275 280 285 

Asn Gin Asp Leu Asn Tyr Cys Thr Asn His Arg Pro Cys Lys Asn Gly 
290 295 300 

Gly Thr Cys Phe Asn Thr Gly Glu Gly Leu Tyr Thr Cys Lys Cys Ala 
305 310 315 320 

Pro Gly Tyr Ser Gly Asp Asp Cys Glu Asn Glu He Tyr Ser Cys Asp 

325 330 335 

Ala Asp Val Asn Pro Cys Gin Asn Gly Gly Thr Cys He Asp Glu Pro 

340 345 350 

His Thr Lys Thr Gly Tyr Lys Cys His Cys Ala Asn Gly Trp Ser Gly 
355 360 365 

Lys Met Cys Glu Glu Lys Val Leu Thr Cys Ser Asp Lys Pro Cys His 
370 375 380 

Gin Gly He Cys Arg Asn Val Arg Pro Gly Leu Gly Ser Lys Gly Gin 
385 390 395 400 

Gly Tyr Gin Cys Glu Cys Pro He Gly Tyr Ser Gly Pro Asn Cys Asp 

405 410 415 

Leu Gin Leu Asp Asn Cys Ser Pro Asn Pro Cys He Asn Gly Gly Ser 

420 425 430 

Cys Gin Pro Ser Gly Lys Cys He Cys Pro Ala Gly Phe Ser Gly Thr 
435 440 445 

Arg Cys Glu Thr Asn He Asp Asp Cys Leu Gly His Gin Cys Glu Asn 
450 455 460 

Gly Gly Thr Cys He Asp Met Val Asn Gin Tyr Arg Cys Gin Cys Val 
465 470 475 480 

Pro Gly Phe His Gly Thr His Cys Ser Ser Lys Val Asp Leu Cys Leu 

485 490 495 

He Arg Pro Cys Ala Asn Gl£ Gly Thr Cys Leu Asn Leu Asn Asn Asp 

500 505 510 

Tyr Gin Cys Thr Cys Arg Ala Gly Phe Thr Gly Lys Asp Cys Ser Val 
515 520 525 

Asp He Asp Glu Cys Ser Ser Gly Pro Cys His Asn Gly Gly Thr Cys 
530 535 540 

Met Asn Arg Val Asn Ser Phe Glu Cys Val Cys Ala Asn Gly Phe Arg 
545 550 555 560 

Gly Lys Gin Cys Asp Glu Glu Ser Tyr Asp Ser Val Thr Phe Asp Ala 



1 



His Gin Tyr Gly 

580 

Ala Gin Val Val 
595 

Ala Val lie Ala 
610 



Arg Ala Gin Glu 
625 

Asn Ala Val Ala 



Leu Ala Ser Ala 

660 

Thr Phe Asp Gly 
675 

Ser Val Asn Asn 
690 

Ala Ala Ala Asp 
705 

Ala Asp Asn Asn 



Arg Ala Lys Ser 

740 

Arg Gly Ser Pro 
755 

Pro Gly Ala Ala 
770 



Tyr Cys Ser Gin 
785 

Ala Cys Ser Ser 



Ala Gly Thr Ala 

820 

Met 



565 

Ala Thr Thr Gin 



Leu He Ala Val 

600 

Ala Cys Val Val 
615 



Lys Asp Asp Ala 
630 

Thr Met His His 
645 



Ser Leu Gly Gly 



Gly Asn Pro Asn 

680 

lie Cys Ala Ser 
695 

Glu Cys Leu Met 
710 



Asn Ala Asn Ser 
725 



Gin Lys Gin Leu 



Ala Gly Ser Ser 

760 

Glu Gly Lys Arg 
775 

Arg Trp Pro Ser 
790 



Gin Leu Met Ala 
805 



Gin Gin Gin Arg 



570 

Ala Arg Ala Asp 
585 

Phe Ser Val Ala 



Phe Cys Met Lys 

620 

Glu Ala Arg Lys 
635 

Asn Gly Ser Gly 
650 

Lys Thr Gly Ser 
665 

lie lie Lys Asn 



Ala Ala Ala Ala 

700 

Tyr Gly Gly Tyr 
715 

Asp Phe Cys Val 
730 

Asn Thr Asp Pro 
745 

Ala Lys Gly Ala 



lie Ser Val Leu 

780 

Leu Ala Ala Ala 
795 

Ala Ala Ser Ala 
810 



Ser Val Val Cys 
825 



Gly Leu Thr Asn 
590 

Met Pro Leu Val 
605 



Arg Lys Arg Lys 



Gin Asn Glu Gin 

640 

Val Gly Val Ala 
655 

Asn Ser Gly Leu 
670 

Thr Trp Asp Lys 
685 



Ala Ala Ala Ala 



Val Ala Ser Val 

720 

Ala Pro Leu Gin 
735 

Thr Leu Met His 
750 



Ser Gly Gly Gly 
765 

Gly Glu Gly Ser 



Gly Val Ala Gly 

800 

Ala Gly Ser Gly 
815 

Gly Thr Pro His 
830 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 442.. 1320 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



CCGAGTCGAfi 






TUAGCCCCTT 


TTCTGTCAAC 


GCTAAAGATC 


60 


TACAAAACAT 


CAGCGCCTAT 


CAAGTGGAAG 


TGTCAAGTGT 


GAACAAAACA 


AAAACGAGAG 


120 


AAGCACATAC 


TAAGGTCCAT 


ATAAATAATA 


AATAATAATT 


GTGTGTGATA 


ACAACATTAT 


180 


CCAAACAAAA 


CCAAACAAAA 


CGAAGGCAAA 


GTGGAGAAAA 


TGATACAGCA 


TCCAGAGTAC 


240 


GGCCGTTATT 


CAGCTATCCA 


GAGCAAGTGT 


AGTGTGGCAA 


AATAGAAACA 


AACAAAGGCA 


300 


CCAAAATCTG 


CATACATGGG 


CTAATTAAGG 


CTGCCCAGCG 


AATTTACATT 


TGTGTGGTGC 


360 


CAATCCA6AG 


TGAATCCGAA 


ACAAACTCCA 


TCTAGATCGC 


CAACCAGCAT 


CACGCTCGCA 


420 


AACGCCCCCA 


GAATGTACAA 


A ATG TTT AGG AAA CAT 


TTT CGG CGA AAA CCA 


471 



Met Phe Arg Lys His Phe Arg Arg Lys Pro 
15 10 

GCT ACG TCG TCG TCG TTG GAG TCA AC A ATA GAA TCA GCA GAC AGC CTG 519 
Ala Thr Ser Ser Ser Leu Glu Ser Thr lie Glu Ser Ala Asp Ser Leu 

.15 20 25 

GGA ATG TCC AAG AAG ACG GCG ACA AAA AGG CAG CGT CCG AGG CAT CGG 567 
Gly Met Ser Lys Lys Thr Ala Thr Lys Arg Gin Arg Pro Arg His Arg 

30 35 40 

GTA CCC AAA ATC GCG ACC CTG CCA TCG ACG ATC CGC GAT TGT CGA TCA 615 
Val Pro Lys lie Ala Thr Leu Pro Ser Thr lie Arg Asp Cys Arg Ser 

45 50 55 

TTA AAG TCT GCC TGC AAC TTA ATT GCT TTA ATT TTA ATA CTG TTA GTC 663 
Leu Lys Ser Ala Cys Asn Leu lie Ala Leu lie Leu lie Leu Leu Val 
60 65 70 

CAT AAG ATA TCC GCA GCT GGT AAC TTC GAG CTG GAA ATA TTA GAA ATC 711 
His Lys lie Ser Ala Ala Gly Asn Phe Glu Leu Glu lie Leu Glu He 
75 80 85 90 

TCA AAT ACC AAC AGC CAT CTA CTC AAC GGC TAT TGC TGC GGC ATG CCA 759 
Ser Asn Thr Asn Ser His Leu Leu Asn Gly Tyr Cys Cys Gly Met Pro 

95 100 105 

GCG GAA CTT AGG GCC ACC AAG ACG ATA GGC TGC TCG CCA TGC ACG ACG 807 
Ala Glu Leu Arg Ala Thr Lys Thr He Gly Cys Ser Pro Cys Thr Thr 

110 115 120 

GCA TTC CGG CTG TGC CTG AAG GAG TAG CAG ACC ACG GAG CAG GGT GCC 855 
Ala Phe Arg Leu Cys Leu Lys Glu Tyr Gin Thr Thr Glu Gin Gly Ala 
125 130 135 

AGC ATA TCC ACG GGC TGT TCG TTT GGC AAC GCC ACC ACC AAG ATA CTG 903 
Ser He Ser Thr Gly Cys Ser Phe Gly Asn Ala Thr Thr Lys lie Leu 
140 145 150 

GGT GGC TCC AGC TTT GTG CTC AGC GAT CCG GGT GTG GGA GCC ATT GTG 951 
Gly Gly Ser Ser Phe Val Leu Ser Asp Pro Gly Val Gly Ala He Val 
155 160 165 170 

CTG CCC TTT ACG TTT CGT TGG ACG AAG TCG TTT ACG CTG ATA CTG CAG 999 
Leu Pro Phe Thr Phe Arg Trp Thr Lys Ser Phe Thr Leu He Leu Gin 

175 180 185 

GCG TTG GAT ATG TAC AAC ACA TCC TAT CCA GAT GCG GAG AGG TTA ATT 1047 
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Ala Leu Asp Met Tyr Asn Thr Ser Tyr Pro Asp Ala Glu Arg Leu lie 

190 195 200 

GAG GAA ACA TCA TAC TCG GGC GTG ATA CTG CCG TCG CCG GAG TGG AAG 1095 
Glu Glu Thr Ser Tyr Ser Gly Val lie Leu Pro Ser Pro Glu Trp Lys 
205 210 215 

ACG CTG GAC CAC ATC GGG CGG AAC GCG CGG ATC ACC TAC CGT GTC CGG 1143 
Thr Leu Asp His lie Gly Arg Asn Ala Arg lie Thr Tyr Arg Val Arg 
220 225 230 

GTG CAA TGC GCC GTT ACC TAC TAC AAC ACG ACC TGC ACG ACC TTC TGC 1191 
Val Gin Cys Ala Val Thr Tyr Tyr Asn Thr Thr Cys Thr Thr Phe Cys 
235 240 245 250 

CGT CCG CGG GAC GAT CAG TTC GGT CAC TAC GCC TGC GGC TCC GAG GGT 1239 
Arg Pro Arg Asp Asp Gin Phe Gly His Tyr Ala Cys Gly Ser Glu Gly 

255 260 265 

CAG AAG CTC TGC CTG AAT GGC TGG CAG GGC GTC AAC TGC GAG GAG GCC 1287 
Gin Lys Leu Cys Leu Asn Gly Trp Gin Gly Val Asn Cys Glu Glu Ala 

270 275 280 

ATA TGC AAG GCG GGC TGC GAC CCC GTC CAC GGC 1320 
lie Cys Lys Ala Gly Cys Asp Pro Val His Gly 
285 290 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 293 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Phe Arg Lys His Phe Arg Arg Lys Pro Ala Thr Ser Ser Ser Leu 
15 10 15 

Glu Ser Thr lie Glu Ser Ala Asp Ser Leu Gly Met Ser Lys Lys Thr 

20 25 30 

Ala Thr Lys Arg Gin Arg Pro Arg His Arg Val Pro Lys lie Ala Thr 

35 40 45 

Leu Pro Ser Thr lie Arg Asp Cys Arg Ser Leu Lys Ser Ala Cys Asn 
50 55 60 

Leu lie Ala Leu lie Leu lie Leu Leu Val His Lys lie Ser Ala Ala 
65 70 75 80 

Gly Asn Phe Glu Leu Glu lie Leu Glu lie Ser Asn Thr Asn Ser His 

85 90 95 

Leu Leu Asn Gly Tyr Cys Cys Gly Met Pro Ala Glu Leu Arg Ala Thr 

100 105 110 

Lys Thr lie Gly Cys Ser Pro Cys Thr Thr Ala Phe Arg Leu Cys Leu 
115 120 125 

Lys Glu Tyr Gin Thr Thr Glu Gin Gly Ala Ser lie Ser Thr Gly Cys 
130 135 140 

Ser Phe Gly Asn Ala Thr Thr Lys lie Leu Gly Gly Ser Ser Phe Val 
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145 150 155 160 

Leu Ser Asp Pro Gly Val Gly Ala lie Val Leu Pro Phe Thr Phe Arg 

165 170 175 

Trp Thr Lys Ser Phe Thr Leu lie Leu Gin Ala Leu Asp Met Tyr Asn 

180 185 190 

Thr Ser Tyr Pro Asp Ala Glu Arg Leu lie Glu Glu Thr Ser Tyr Ser 
195 200 205 

Gly Val lie Leu Pro Ser Pro Glu Trp Lys Thr Leu Asp His lie Gly 
210 215 220 

Arg Asn Ala Arg lie Thr Tyr Arg Val Arg Val Gin Cys Ala Val Thr 
225 230 235 240 

Tyr Tyr Asn Thr Thr Cys Thr Thr Phe Cys Arg Pro Arg Asp Asp Gin 

245 250 255 

Phe Gly His Tyr Ala Cys Gly Ser Glu Gly Gin Lys Leu Cys Leu Asn 

260 265 270 

Gly Trp Gin Gly Val Asn Cys Glu Glu Ala lie Cys Lys Ala Gly Cys 
275 280 285 

Asp Pro Val His Gly 
290 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGGTGGACTT CCTTCGTGTA TTGGTGGGAG CCCTCGGGAA CGGGGGGTAA CACTGAAAGG 
TCGAGTACCC ATTTCCGTCA TAACGGGTTG GTCGCCCCCT AGGGGTCGGA GTCAGGTGGA 
CGGGAGGTCG ACAACGCCCG GGGGACGGGT GGTACATGGT GTAAGGTCTT TACCGGACCG 
GGCAAACGGG TCACACCGAA AGGGGTGAAC GGTAACTACG GGGTCGTCCT GCCCGTCCAT 
CGAGTCTGGT AAGAGGGTCG CCTTAAG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GAATTCCTTC CATTATACGT GACTTTTCTG AAACTGTAGC CACCCTAGTG TCTCTAACTC 
CCTCTGGAGT TTGTCAGCTT TGGTCTTTTC AAAGAGCAGG CTCTCTTCAA GCTCCTTAAT 
GCGGGCATGC TCCAGTTTGG TCTGCGTCTC AAGATCACCT TTGGTAATTG ATTCTTCTTC 
AACCCGGAAC TGAAGGCTGG CTCTCACCCT CTAGGCAGAG CAGGAATTCC GAGGTGGATG 
TGTTAGATGT GAATGTCCGT GGCCCAGATG GCTGCACCCC ATTGATGTTG GCTTCTCTCC 
GAGGAGGCAG CTCAGATTTG AGTGATGAAG ATGAAGATGC AGAGGACTGT TCTGCTAACA 
TCATCACAGA CTTGGTCTAC CAGGGTGCCA GCCTCCAGNC CAGACAGACC GGACTGGTGA 
GATGGCCCTG CACCTTGCAG CCCGCTACTC ACGGGCTGAT GCTGCCAAGC GTCTCCTGGA 
TGCAGGTGCA GATGCCAATG CCCAGGACAA CATGGGCCGC TGTCCACTCC ATGCTGCAGT 
GGCACGTGAT GCCAAGGTGT ATTCAGATCT GTTA 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TCCAGATTCT GATTCGCAAC CGAGTAACTG ATCTAGATGC CAGGATGAAT GATGGTACTA 
CACCCCTGAT CCTGGCTGCC CGCCTGGCTG TGGAGGGAAT GGTGGCAGAA CTGATCAACT 
GCCAAGCGGA TGTGAATGCA GTGGATGACC ATGGAAAATC TGCTCTTCAC TGGGCAGCTG 
CTGTCAATAA TGTGGAGGCA ACTCTTTTGT TGTTGAAAAA TGGGGCCAAC CGAGACATGC 
AGGACAACAA GGAAGAGACA CCTCTGTTTC TTGCTGCCCG GGAGGAGCTA TAAGC 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GAATTCCATT CAGGAGGAAA GGGTGGGGAG AGAAGCAGGC ACCCACTTTC CCGTGGCTGG 
ACTCGTTCCC AGGTGGCTCC ACCGGCAGCT GTGACCGCCG CAGGTGGGGG CGGAGTGCCA 
TTCAGAAAAT TCCAGAAAAG CCCTACCCCA ACTCGGACGG CAACGTCACA CCCGTGGGTA 
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GCAACTGGCA CACAAACAGC CAGCGTGTCT GGGGCACGGG GGGATGGCAC CCCCTGCAGG 240 
CAGAGCTG 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TACGTATCTC GAGCACAGAC AGCTGACGTA CACTTTTNNA GTGCGAGGGA CATTCGTCCG 60 

ACCAGTACGA ACATTTAGGC TCAGTACGGT AGGTCCATGG CCAAGACTAG GAGACGTAGG 120 

GAGCTACAGG TCCCGCTCGC TAAACTCGGA CCACTGAAAC CTCCGGTCGA CAGTCGGTAA 180 

GCGAACAAGA GGGCCAGATC TTAGAGAAGG TGTCGCGGCG AGACTCGGGC TCGGGTCAGG 240 

4 

CGGCCTTAAG GACGTCGGGC CCNNNAGGTG ATCAAGATCT CGNCNCGGCG GGCGCCACCT 300 
CGAGGNCGAA AACAAGGGAA ATC 323 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

( ix ) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..3234 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TGC CAG GAG GAC GCG GGC AAC AAG GTC TGC AGC CTG CAG TGC AAC AAC 48 
Cys Gin Glu Asp Ala Gly Asn Lys Val Cys Ser Leu Gin Cys Asn Asn 
15 10 15 

CAC GCG TGC GGC TGG GAC GGC GGT GAC TGC TCC CTC AAC TTC AAT GAC 96 
His Ala Cys Gly Trp Asp Gly Gly Asp Cys Ser Leu Asn Phe Asn Asp 

20 25 30 

CCC TGG AAG AAC TGC ACG CAG TCT CTG CAG TGC TGG AAG TAC TTC AGT 144 
Pro Trp Lys Asn Cys Thr Gin Ser Leu Gin Cys Trp Lys Tyr Phe Ser 

35 40 45 

GAC GGC CAC TGT GAC AGC CAG TGC AAC TCA GCC GGC TGC CTC TTC GAC 192 
Asp Gly His Cys Asp Ser Gin Cys Asn Ser Ala Gly Cys Leu Phe Asp 
50 55 60 

GGC TTT GAC TGC CAG CGT GCG GAA GGC CAG TGC AAC CCC CTG TAC GAC 240 
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Gly Phe Asp Cys Gin Arg Ala Glu Gly Gin Cys Asn Pro Leu Tyr Asp 
65 70 75 80 

CAG TAC TGC AAG GAC CAC TTC AGC GAC GGG CAC TGC GAC CAG GGC TGC 288 
Gin Tyr Cys Lys Asp His Phe Ser Asp Gly His Cys Asp Gin Gly Cys 

85 90 95 

AAC AGC GCG GAG TGC GAG TGG GAC GGG CTG GAC TGT GCG GAG CAT GTA 336 
Asn Ser Ala Glu Cys Glu Trp Asp Gly Leu Asp Cys Ala Glu His Val 

100 105 110 

CCC GAG AGG CTG GCG GCC GGC ACG CTG GTG GTG GTG GTG CTG ATG CCG 384 
Pro Glu Arg Leu Ala Ala Gly Thr Leu Val Val Val Val Leu Met Pro 
115 120 125 

CCG GAG CAG CTG CGC AAC AGC TCC TTC CAC TTC CTG CGG GAG CTC AGC 432 
Pro Glu Gin Leu Arg Asn Ser Ser Phe His Phe Leu Arg Glu Leu Ser 
130 135 140 

CGC GTG CTG CAC ACC AAC GTG GTC TTC AAG CGT GAC GCA CAC GGC CAG 480 
Arg Val Leu His Thr Asn Val Val Phe Lys Arg Asp Ala His Gly Gin 
145 150 155 160 

CAG ATG ATC TTC CCC TAC TAC GGC CGC GAG GAG GAG CTG CGC AAG CAC 528 
Gin Met lie Phe Pro Tyr Tyr Gly Arg Glu Glu Glu Leu Arg Lys His 

165 170 175 

CCC ATC AAG CGT GCC GCC GAG GGC TGG GCC GCA CCT GAC GCC CTG CTG 576 
Pro lie Lys Arg Ala Ala Glu Gly Trp Ala Ala Pro Asp Ala Leu Leu 

180 185 190 

GGC CAG GTG AAG GCC TCG CTG CTC CCT GGT GGC AGC GAG GGT GGG CGG 624 
Gly Gin Val Lys Ala Ser Leu Leu Pro Gly Gly Ser Glu Gly Gly Arg 
195 200 205 

CGG CGG AGG GAG CTG GAC CCC ATG GAC GTC CGC GGC TCC ATC GTC TAC 672 
Arg Arg Arg Glu Leu Asp Pro Met Asp Val Arg Gly Ser lie Val Tyr 
210 215 220 

CTG GAG ATT GAC AAC CGG CAG TGT GTG CAG GCC TCC TCG CAG TGC TTC 720 
Leu Glu lie Asp Asn Arg Gin Cys Val Gin Ala Ser Ser Gin Cys Phe 
225 230 235 240 

CAG AGT GCC ACC GAC GTG GCC GCA TTC CTG GGA GCG CTC GCC TCG CTG 768 
Gin Ser Ala Thr Asp Val Ala Ala Phe Leu Gly Ala Leu Ala Ser Leu 

245 250 255 

GGC AGC CTC AAC ATC CCC TAC AAG ATC GAG GCC GTG CAG AGT GAG ACC 816 
Gly Ser Leu Asn lie Pro Tyr Lys lie Glu Ala Val Gin Ser Glu Thr 

260 265 270 

GTG GAG CCG CCC CCG CCG GCG CAG CTG CAC TTC ATG TAC GTG GCG GCG 864 
Val Glu Pro Pro Pro Pro Ala Gin Leu His Phe Met Tyr Val Ala Ala 
275 280 285 

GCC GCC TTT GTG CTT CTG TTC TTC GTG GGC TGC GGG GTG CTG CTG TCC 912 
Ala Ala Phe Val Leu Leu Phe Phe Val Gly Cys Gly Val Leu Leu Ser 
290 295 300 

CGC AAG CGC CGG CGG CAG CAT GGC CAG CTC TGG TTC CCT GAG GGC TTC 960 
Arg Lys Arg Arg Arg Gin His Gly Gin Leu Trp Phe Pro Glu Gly Phe 
305 310 315 320 

AAA GTG TCT GAG GCC AGC AAG AAG AAG CGG CGG GAG CCC CTC GGC GAG 1008 
Lys Val Ser Glu Ala Ser Lys Lys Lys Arg Arg Glu Pro Leu Gly Glu 

325 330 335 
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GAC TCC GTG GGC CTC AAG CCC CTG AAG AAC GCT TCA GAC GGT GCC CTC 1056 
Asp Ser Val Gly Leu Lys Pro Leu Lys Asn Ala Ser Asp Gly Ala Leu 

340 345 350 

ATG GAC GAC AAC CAG AAT GAG TGG GGG GAC GAG GAC CTG GAG ACC AAG 1104 
Met Asp Asp Asn Gin Asn Glu Trp Gly Asp Glu Asp Leu Glu Thr Lys 
355 360 365 

AAG TTt! CGG TTC GAG GAG CCC GTG GTT CTG CCT GAC CTG GAC GAC CAG 1152 
Lys Phe Arg Phe Glu Glu Pro Val Val Leu Pro Asp Leu Asp Asp Gin 
370 375 380 

ACA GAC CAC CGG CAG TGG ACT CAG CAG CAC CTG GAT GCC GCT GAC CTG 1200 
Thr Asp His Arg Gin Trp Thr Gin Gin His Leu Asp Ala Ala Asp Leu 
385 390 395 400 

CGC ATG TCT GCC ATG GCC CCC ACA CCG CCC CAG GGT GAG GTT GAC GCC 1248 
Arg Met Ser Ala Met Ala Pro Thr Pro Pro Gin Gly Glu Val Asp Ala 

405 410 415 

GAC TGC ATG GAC GTC AAT GTC CGC GGG CCT GAT GGC TTC ACC CCG CTC 1296 
Asp Cys Met Asp Val Asn Val Arg Gly Pro Asp Gly Phe Thr Pro Leu 

420 425 430 

ATG ATC GCC TCC TGC AGC GGG GGC GGC CTG GAG ACG GGC AAC AGC GAG 1344 
Met lie Ala Ser Cys Ser Gly Gly Gly Leu Glu Thr Gly Asn Ser Glu 
435 440 445 

GAA GAG GAG GAC GCG CCG GCC 6TC ATC TCC GAC TTC ATC TAC CAG GGC 1392 
Glu Glu Glu Asp Ala Pro Ala Val lie Ser Asp Phe lie Tyr Gin Gly 
450 455 460 

GCC AGC CTG CAC AAC CAG ACA GAC CGC ACG GGC GAG ACC GCC TTG CAC 1440 
Ala Ser Leu His Asn Gin Thr Asp Arg Thr Gly Glu Thr Ala Leu His 
465 470 475 480 

CTG GCC GCC CGC TAC TCA CGC TCT GAT GCC GCC AAG CGC CTG CTG GAG 1488 
Leu Ala Ala Arg Tyr Ser Arg Ser Asp Ala Ala Lys Arg Leu Leu Glu 

485 490 495 

GCC AGC GCA GAT GCC AAC ATC CAG GAC AAC ATG GGC CGC ACC CCG CTG 1536 
Ala Ser Ala Asp Ala Asn lie Gin Asp Asn Met Gly Arg Thr Pro Leu 

500 505 510 

CAT GCG GCT GTG TCT GCC GAC GCA CAA GGT GTC TTC CAG ATC CTG ATC 1584 
His Ala Ala Val Ser Ala Asp Ala Gin Gly Val Phe Gin He Leu He 
515 520 525 

CGG AAC CGA GCC ACA GAC CTG GAT GCC CGC ATG CAT GAT GGC ACG ACG 1632 
Arg Asn Arg Ala Thr Asp Leu Asp Ala Arg Met His Asp Gly Thr Thr 
530 535 540 

CCA CTG ATC CTG GCT GCC CGC CTG GCC GTG GAG GGC ATG CTG GAG GAC 1680 
Pro Leu lie Leu Ala Ala Arg Leu Ala Val Glu Gly Met Leu Glu Asp 
545 550 555 560 

CTC ATC AAC TCA CAC GCC GAC GTC AAC GCC GTA GAT GAC CTG GGC AAG 1728 
Leu He Asn Ser His Ala Asp Val Asn Ala Val Asp Asp Leu Gly Lys 

565 570 575 

TCC GCC CTG CAC TGG GCC GCC GCC GTG AAC AAT GTG GAT GCC GCA GTT 1776 
Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn Val Asp Ala Ala Val 

580 585 590 

GTG CTC CTG AAG AAC GGG GCT AAC AAA GAT ATG CAG AAC AAC AGG GAG 1824 
Val Leu Leu Lys Asn Gly Ala Asn Lys Asp Met Gin Asn Asn Arg Glu 
595 600 605 
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GAG ACA CCC CTG TTT CTG GCC GCC CGG GAG GGC AGC TAC GAG ACC GCC 1872 
Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala 
610 615 620 

AAG GTG CTG CTG GAC CAC TTT GCC AAC CGG GAC ATC ACG GAT CAT ATG 1920 
Lys Val Leu Leu Asp His Phe Ala Asn Arg Asp lie Thr Asp His Met 
625 630 635 640 

GAC CGC CTG CCG CGC GAC ATC GCA CAG GAG CGC ATG CAT CAC GAC ATC 1968 
Asp Arg Leu Pro Arg Asp lie Ala Gin Glu Arg Met His His Asp lie 

645 650 655 

GTG AGG CTG CTG GAC GAG TAC AAC CTG GTG CGC AGC CCG CAG CTG CAC 2016 
Val Arg Leu Leu Asp Glu Tyr Asn Leu Val Arg Ser Pro Gin Leu His 

660 665 670 

GGA GCC CCG CTG GGG GGC ACG CCC ACC CTG TCG CCC CCG CTC TGC TCG 2064 
Gly Ala Pro Leu Gly Gly Thr Pro Thr Leu Ser Pro Pro Leu Cys Ser 
675 680 685 

CCC AAC GGC TAC CTG GGC AGC CTC AAG CCC GGC GTG CAG GGC AAG AAG 2112 
Pro Asn Gly Tyr Leu Gly Ser Leu Lys Pro Gly Val Gin Gly Lys Lys 
690 695 700 

GTC CGC AAG CCC AGC AGC AAA GGC CTG GCC TGT GGA AGC AAG GAG GCC 2160 
Val Arg Lys Pro Ser Ser Lys Gly Leu Ala Cys Gly Ser Lys Glu Ala 
705 710 715 720 

AAG GAC CTC AAG GCA CGG AGG AAG AAG TCC CAG GAT GGC AAG GGC TGC 2208 
Lys Asp Leu Lys Ala Arg Arg Lys Lys Ser Gin Asp Gly Lys Gly Cys 

725 730 735 

CTG CTG GAC AGC TCC GGC ATG CTC TCG CCC GTG GAC TCC CTG GAG TCA 2256 
Leu Leu Asp Ser Ser Gly Met: Leu Ser Pro Val Asp Ser Leu Glu Ser 

740 745 750 

CCC CAT GGC TAC CTG TCA GAC GTG GCC TCG CCG CCA CTG CTG CCC TCC 2304 
Pro His Gly Tyr Leu Ser Asp Val Ala Ser Pro Pro Leu Leu Pro Ser 
755 760 765 

CCG TTC CAG CAG TCT CCG TCC GTG CCC CTC AAC CAC CTG CCT GGG ATG 2352 
Pro Phe Gin Gin Ser Pro Ser Val Pro Leu Asn His Leu Pro Gly Met 
770 775 780 

CCC GAC ACC CAC CTG GGC ATC GGG CAC CTG AAC GTG GCG GCC AAG CCC 2400 
Pro Asp Thr His Leu Gly lie Gly His Leu Asn Val Ala Ala Lys Pro 
785 790 795 800 

GAG ATG GCG GCG CTG GGT GGG GGC GGC CGG CTG GCC TTT GAG ACT GGC 2448 
Glu Met Ala Ala Leu Gly Gly Gly Gly Arg Leu Ala Phe Glu Thr Gly 

805 810 815 

CCA CCT CGT CTC TCC CAC CTG CCT GTG GCC TCT GGC ACC AGC ACC GTC 2496 
Pro Pro Arg Leu Ser His Leu Pro Val Ala Ser Gly Thr Ser Thr Val 

820 825 830 

CTG GGC TCC AGC AGC GGA GGG GCC CTG AAT TTC ACT GTG GGC GGG TCC 2 544 

Leu Gly Ser Ser Ser Gly Gly Ala Leu Asn Phe Thr Val Gly Gly Ser 
835 840 845 

ACC AGT TTG AAT GGT CAA TGC GAG TGG CTG TCC CGG CTG CAG AGC GGC 2592 
Thr Ser Leu Asn Gly Gin Cys Glu Trp Leu Ser Arg Leu Gin Ser Gly 
850 855 860 

ATG GTG CCG AAC CAA TAC AAC CCT CTG CGG GGG AGT GTG GCA CCA GGC 2640 
Met Val Pro Asn Gin Tyr Asn Pro Leu Arg Gly Ser Val Ala Pro Gly 
865 870 875 880 
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CCC CTG AGC ACA CAG GCC CCC TCC CTG CAG CAT GGC ATG GTA GGC CCG 2 688 

Pro Leu Ser Thr Gin Ala Pro Ser Leu Gin His Gly Met Val Gly Pro 

885 890 895 

CTG CAC AGT AGC CTT GCT GCC AGC GCC CTG TCC CAG ATG ATG AGC TAC 2736 
Leu His Ser Ser Leu Ala Ala Ser Ala Leu Ser Gin Met Met Ser Tyr 

900 905 910 

CAG GGC CTG CCC AGC ACC CGG CTG GCC ACC CAG CCT CAC CTG GTG CAG 2784 
Gin Gly Leu Pro Ser Thr Arg Leu Ala Thr Gin Pro His Leu Val Gin 
915 920 925 

ACC CAG CAG GTG CAG CCA CAA AAC TTA CAG ATG CAG CAG CAG AAC CTG 2832 
Thr Gin Gin Val Gin Pro Gin Asn Leu Gin Met Gin Gin Gin Asn Leu 
930 935 940 

CAG CCA GCA AAC ATC CAG CAG CAG CAA AGC CTG CAG CCG CCA CCA CCA 2880 
Gin Pro Ala Asn lie Gin Gin Gin Gin Ser Leu Gin Pro Pro Pro Pro 
945 950 955 960 

CCA CCA CAG CCG CAC CTT GGC GTG AGC TCA GCA GCC AGC GGC CAC CTG 2928 
Pro Pro Gin Pro His Leu Gly Val Ser Ser Ala Ala Ser Gly His Leu 

965 970 975 

GGC CGG AGC TTC CTG AGT GGA GAG CCG AGC CAG GCA GAC GTG CAG CCA 2976 
Gly Arg Ser Phe Leu Ser Gly Glu Pro Ser Gin Ala Asp Val Gin Pro 

980 985 990 

CTG GGC CCC AGC AGC CTG GCG GTG CAC ACT ATT CTG CCC CAG GAG AGC 3024 
Leu Gly Pro Ser Ser Leu Ala Val His Thr lie Leu Pro Gin Glu Ser 
995 1000 1005 

CCC GCC CTG CCC ACG TCG CTG CCA TCC TCG CTG GTC CCA CCC GTG ACC 3072 
Pro Ala Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr 
1010 1015 1020 

GCA GCC CAG TTC CTG ACG CCC CCC TCG CAG CAC AGC TAC TCC TCG CCT 3120 
Ala Ala Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro 
1025 1030 1035 1040 

GTG GAC AAC ACC CCC AGC CAC CAG CTA CAG GTG CCT GTT CCT GTA ATG 3168 
Val Asp Asn Thr Pro Ser His Gin Leu Gin Val Pro Val Pro Val Met 

1045 1050 1055 

GTA ATG ATC CGA TCT TCG GAT CCT TCT AAA GGC TCA TCA ATT TTG ATC 3216 
Val Met lie Arg Ser Ser Asp Pro Ser Lys Gly Ser Ser lie Leu lie 

1060 1065 1070 

GAA GCT CCC GAC TCA TGG 3234 
Glu Ala Pro Asp Ser Trp 
1075 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1078 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

Cys Gin Glu Asp Ala Gly Asn Lys Val Cys Ser Leu Gin Cys Asn Asn 
15 10 15 
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His Ala Cys Gly Trp Asp Gly Gly Asp Cys Ser Leu Asn Phe Asn Asp 

20 25 30 

Pro Trp Lys Asn Cys Thr Gin Ser Leu Gin Cys Trp Lys Tyr Phe Ser 

35 40 45 

Asp Gly His Cys Asp Ser Gin Cys Asn Ser Ala Gly Cys Leu Phe Asp 
50 55 60 

Gly Phe Asp Cys Gin Arg Ala Glu Gly Gin Cys Asn Pro Leu Tyr Asp 
65 70 75 80 

Gin Tyr Cys Lys Asp His Phe Ser Asp Gly His Cys Asp Gin Gly Cys 

85 90 95 

Asn Ser Ala Glu Cys Glu Trp Asp Gly Leu Asp Cys Ala Glu His Val 

100 105 110 

Pro Glu Arg Leu Ala Ala Gly Thr Leu Val Val Val Val Leu Met Pro 
115 120 125 

Pro Glu Gin Leu Arg Asn Ser Ser Phe His Phe Leu Arg Glu Leu Ser 
130 135 140 

Arg Val Leu His Thr Asn Val Val Phe Lys Arg Asp Ala His Gly Gin 
145 150 155 160 

Gin Met lie Phe Pro Tyr Tyr Gly Arg Glu Glu Glu Leu Arg Lys His 

165 170 175 

Pro lie Lys Arg Ala Ala Glu Gly Trp Ala Ala Pro Asp Ala Leu Leu 

180 185 190 

Gly Gin Val Lys Ala Ser Leu Leu Pro Gly Gly Ser Glu Gly Gly Arg 
195 200 205 

Arg Arg Arg Glu Leu Asp Pro Met Asp Val Arg Gly Ser He Val Tyr 
210 215 220 

Leu Glu lie Asp Asn Arg Gin Cys Val Gin Ala Ser Ser Gin Cys Phe 
225 230 235 240 

Gin Ser Ala Thr Asp Val Ala Ala Phe Leu Gly Ala Leu Ala Ser Leu 

245 250 255 

Gly Ser Leu Asn lie Pro Tyr Lys He Glu Ala Val Gin Ser Glu Thr 

260 265 270 

Val Glu Pro Pro Pro Pro Ala Gin Leu His Phe Met Tyr Val Ala Ala 
275 280 285 

Ala Ala Phe Val Leu Leu Phe Phe Val Gly Cys Gly Val Leu Leu Ser 
290 295 300 

Arg Lys Arg Arg Arg Gin His Gly Gin Leu Trp Phe Pro Glu Gly Phe 
305 310 315 320 

Lys Val Ser Glu Ala Ser Lys Lys Lys Arg Arg Glu Pro Leu Gly Glu 

325 330 335 

Asp Ser Val Gly Leu Lys Pro Leu Lys Asn Ala Ser Asp Gly Ala Leu 

340 345 350 

Met Asp Asp Asn Gin Asn Glu Trp Gly Asp Glu Asp Leu Glu Thr Lys 
355 360 365 

Lys Phe Arg Phe Glu Glu Pro Val Val Leu Pro Asp Leu Asp Asp Gin 





370 

Thr Asp His Arg 
385 

Arg Met Ser Ala 



Asp ay's Met Asp 

420 



Met lie Ala Ser 
435 

Glu Glu Glu Asp 
450 

Ala Ser Leu His 
465 

Leu Ala Ala Arg 



Ala Ser Ala Asp 

500 



His Ala Ala Val 

515 . 

Arg Asn Arg Ala 
530 



Fro Leu lie Leu 
545 



Leu lie Asn Ser 



Ser Ala Leu His 

580 

Val Leu Leu Lys 
595 

Glu Thr Pro Leu 
610 



Lys Val Leu Leu 
625 

Asp Arg Leu Pro 



Val Arg Leu Leu 

660 



Gly Ala Pro Leu 
675 



Pro Asn Gly Tyr 
690 



Val Arg Lys Pro 
705 



Lys Asp Leu Lys 



375 

Gin Trp Thr Gin 
390 

Met Ala Pro Thr 
405 

Val Asn Val Arg 



Cys Ser Gly Gly 

440 

Ala Pro Ala Val 
455 

Asn Gin Thr Asp 
470 



Tyr Ser Arg Ser 
485 

Ala Asn lie Gin 



Ser Ala Asp Ala 

520 



Thr Asp Leu Asp 
535 



Ala Ala Arg Leu 
550 

His Ala Asp Val 
565 

Trp Ala Ala Ala 



Asn Gly Ala Asn 

600 



Phe Leu Ala Ala 
615 

Asp His Phe Ala 
630 

Arg Asp lie Ala 
645 



Asp Glu Tyr Asn 



Gly Gly Thr Pro 

680 

Leu Gly Ser Leu 
695 

Ser Ser Lys Gly 
710 



Ala Arg Arg Lys 
725 
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380 

Gin His Leu Asp 
395 

Pro Pro Gin Gly 
410 

Gly Pro Asp Gly 
425 

Gly Leu Glu Thr 



lie Ser Asp Phe 

460 

Arg Thr Gly Glu 
475 



Asp Ala Ala Lys 
490 

Asp Asn Met Gly 
505 

Gin Gly Val Phe 



Ala Arg Met His 

540 



Ala Val Glu Gly 
555 

Asn Ala Val Asp 
570 

Val Asn Asn Val 
585 

Lys Asp Met Gin 



Arg Glu Gly Ser 

620 



Asn Arg Asp lie 
635 

Gin Glu Arg Met 
650 

Leu Val Arg Ser 
665 

Thr Leu Ser Pro 



Lys Pro Gly Val 

700 



Leu Ala Cys Gly 
715 

Lys Ser Gin Asp 
730 



Ala Ala Asp Leu 

400 

Glu Val Asp Ala 
415 



Phe Thr Pro Leu 
430 

Gly Asn Ser Glu 
445 

lie Tyr Gin Gly 



Thr Ala Leu His 

480 



Arg Leu Leu Glu 
495 



Arg Thr Pro Leu 
510 

Gin lie Leu lie 
525 

Asp Gly Thr Thr 



Met Leu Glu Asp 

560 



Asp Leu Gly Lys 
575 

Asp Ala Ala Val 

59,0 

Asn Asn Arg Glu 
605 

Tyr Glu Thr Ala 



Thr Asp His Met 

640 



His His Asp lie 
655 

Pro Gin Leu His 
670 

Pro Leu Cys Ser 
685 

Gin Gly Lys Lys 



Ser Lys Glu Ala 

720 



Gly Lys Gly Cys 
735 
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Leu Leu Asp Ser Ser Gly Met Leu Ser Pro Val Asp Ser Leu Glu Ser 

740 745 750 

Pro His Gly Tyr Leu Ser Asp Val Ala Ser Pro Pro Leu Leu Pro Ser 
755 760 765 

Pro Phe Gin Gin Ser Pro Ser Val Pro Leu Asn His Leu Pro Gly Met 
770 775 780 

Pro Asp Thr His Leu Gly lie Gly His Leu Asn Val Ala Ala Lys Pro 
785 790 795 800 

Glu Met Ala Ala Leu Gly Gly Gly Gly Arg Leu Ala Phe Glu Thr Gly 

805 810 815 

Pro Pro Arg Leu Ser His Leu Pro Val Ala Ser Gly Thr Ser Thr Val 

820 825 830 

Leu Gly Ser Ser Ser Gly Gly Ala Leu Asn Phe Thr Val Gly Gly Ser 
835 840 845 

Thr Ser Leu Asn Gly Gin Cys Glu Trp Leu Ser Arg Leu Gin Ser Gly 
850 855 860 

Met Val Pro Asn Gin Tyr Asn Pro Leu Arg Gly Ser Val Ala Pro Gly 
865 870 875 880 

Pro Leu Ser Thr Gin Ala Pro Ser Leu Gin His Gly Met Val Gly Pro 

885 890 895 

Leu His Ser Ser Leu Ala Ala Ser Ala Leu Ser Gin Met Met Ser Tyr 

900 905 910 

Gin Gly Leu Pro Ser Thr Arg Leu Ala Thr Gin Pro His Leu Val Gin 
915 920 925 

Thr Gin Gin Val Gin Pro Gin Asn Leu Gin Met Gin Gin Gin Asn Leu 
930 935 940 

Gin Pro Ala Asn lie Gin Gin Gin Gin Ser Leu Gin Pro Pro Pro Pro 
945 950 955 960 

Pro Pro Gin Pro His Leu Gly Val Ser Ser Ala Ala Ser Gly His Leu 

965 970 975 

Gly Arg Ser Phe Leu Ser Gly Glu Pro Ser Gin Ala Asp Val Gin Pro 

980 985 990 

Leu Gly Pro Ser Ser Leu Ala Val His Thr lie Leu Pro Gin Glu Ser 
995 1000 1005 

Pro Ala Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr 
1010 1015 1020 

Ala Ala Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro 
1025 1030 1035 1040 

Val Asp Asn Thr Pro Ser His Gin Leu Gin Val Pro Val Pro Val Met 

1045 1050 1055 

Val Met lie Arg Ser Ser Asp Pro Ser Lys Gly Ser Ser lie Leu lie 

1060 1065 1070 

Glu Ala Pro Asp Ser Trp 
1075 

(2) INFORMATION FOR SEQ ID NO: 12: 



I 



-108- 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



* 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 1972 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

G GAG GTG GAT GTG TTA GAT GTG AAT GTC CGT GGC CCA GAT GGC TGC 46 
Glu Val Aap Val Leu Asp Val Asn Val Arg Gly Pro Asp Gly Cys 
1 5 . 10 15 

ACC CCA TTG ATG TTG GCT TCT CTC CGA GGA GGC AGC TCA GAT TTG AGT 94 
Thr Pro Leu Met Leu Ala Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser 

20 25 30 

GAT GAA GAT GAA GAT GCA GAG GAC TCT TCT GCT AAC ATC ATC ACA GAC 142 
Asp Glu Asp Glu Asp Ala Glu Asp Ser Ser Ala Asn lie lie Thr Asp 

35 40 45 

TTG GTC TAC CAG GGT GCC AGC CTC CAG GCC CAG ACA GAC CGG ACT GGT 190 
Leu Val Tyr Gin Gly Ala Ser Leu Gin Ala Gin Thr Asp Arg Thr Gly 

50 55 60 

GAG ATG GCC CTG CAC CTT GCA GCC CGC TAC TCA CGG GCT GAT GCT GCC 238 
Glu Met Ala Leu His Leu Ala Ala Arg Tyr Ser Arg Ala Asp Ala Ala 
65 70 75 

AAG CGT CTC CTG GAT GCA GGT GCA GAT GCC AAT GCC CAG GAC AAC ATG 286 
Lys Arg Leu Leu Asp Ala Gly Ala Asp Ala Asn Ala Gin Asp Asn Met 
80 85 90 95 

GGC CGC TGT CCA CTC CAT GCT GCA GTG GCA GCT GAT GCC CAA GGT GTC 334 
Gly Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gin Gly Val 

100 105 110 

TTC CAG ATT CTG ATT CGC AAC CGA GTA ACT GAT CTA GAT GCC AGG ATG 382 
Phe Gin lie Leu lie Arg Asn Arg Val Thr Asp Leu Asp Ala Arg Met 

115 120 125 

AAT GAT GGT ACT ACA CCC CTG ATC CTG GCT GCC CGC CTG GCT GTG GAG 430 
Asn Asp Gly Thr Thr Pro Leu lie Leu Ala Ala Arg Leu Ala Val Glu 
130 135 140 

GGA ATG GTG GCA GAA CTG ATC AAC TGC CAA GCG GAT GTG AAT GCA GTG 478 
Gly Met Val Ala Glu Leu lie Asn Cys Gin Ala Asp Val Asn Ala Val 
145 150 155 

GAT GAC CAT GGA AAA TCT GCT CTT CAC TGG GCA GCT GCT GTC AAT AAT 526 
Asp Asp His Gly Lys Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn 
160 165 - 170 175 

GTG GAG GCA ACT CTT TTG TTG TTG AAA AAT GGG GCC AAC CGA GAC ATG 574 
Val Glu Ala Thr Leu Leu Leu Leu Lys Asn Gly Ala Asn Arg Asp Met 

180 185 190 

CAG GAC AAC AAG GAA GAG ACA CCT CTG TTT CTT GCT GCC CGG GAG GGG 62 2 

Gin Asp Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly 

195 200 205 



\ 
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AGC TAT GAA GCA GCC AAG ATC CTG TTA GAC CAT TTT GCC AAT CGA GAC 
Ser Tyr Glu Ala Ala Lys lie Leu Leu Asp His Phe Ala Asn Arg Asp 
210 215 220 

ATC ACA GAC CAT ATG GAT CGT CTT CCC CGG GAT GTG GCT CGG GAT CGC 
lie Thr Asp His Met Asp Arg Leu Pro Arg Asp Val Ala Arg Asp Arg 
225 230 235 

ATG CAC CAT GAC ATT GTG CGC CTT CTG GAT GAA TAC AAT GTG ACC CCA 
Met His His Asp lie Val Arg Leu Leu Asp Glu Tyr Asn Val Thr Pro 
240 . 245 250 255 

AGC CCT CCA GGC ACC GTG TTG ACT TCT GCT CTC TCA CCT GTC ATC TGT 
Ser Pro Pro Gly Thr Val Leu Thr Ser Ala Leu Ser Pro Val lie Cys 

260 265 270 

GGG CCC AAC AGA TCT TTC CTC AGC CTG AAG CAC ACC CCA ATG GGC AAG 
Gly Pro Asn Arg Ser Phe Leu Ser Leu Lys His Thr Pro Met Gly Lys 

275 280 285 

AAG TCT AGA CGG CCC AGT GCC AAG AGT ACC ATG CCT ACT AGC CTC CCT 
Lys Ser Arg Arg Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro 
290 295 300 

AAC CTT GCC AAG GAG GCA AAG GAT GCC AAG GGT AGT AGG AGG AAG AAG 
Asn Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly Ser Arg Arg Lys Lys 
305 310 315 

TCT CTG AGT GAG AAG GTC CAA CTG TCT GAG AGT TCA GTA ACT TTA TCC 
Ser Leu Ser Glu Lys Val Gin Leu Ser Glu Ser Ser Val Thr Leu Ser 
320 325 330 335 

CCT GTT GAT TCC CTA GAA TCT CCT CAC ACG TAT GTT TCC GAC ACC ACA 
Pro Val Asp Ser Leu Glu Ser Pro His Thr Tyr Val Ser Asp Thr Thr 

340 345 350 

TCC TCT CCA ATG ATT ACA TCC CCT GGG ATC TTA CAG GCC TCA CCC AAC 
Ser Ser Pro Met lie Thr Ser Pro Gly lie Leu Gin Ala Ser Pro Asn 

355 360 365 

CCT ATG TTG GCC ACT GCC GCC CCT CCT GCC CCA GTC CAT GCC CAG CAT 
Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro Val His Ala Gin His 
370 375 380 

GCA CTA TCT TTT TCT AAC CTT CAT GAA ATG CAG CCT TTG GCA CAT GGG 
Ala Leu Ser Phe Ser Asn Leu His Glu Met Gin Pro Leu Ala His Gly 
385 390 395 

GCC AGC ACT GTG CTT CCC TCA GTG AGC CAG TTG CTA TCC CAC CAC CAC 
Ala Ser Thr. Val Leu Pro Ser Val Ser Gin Leu Leu Ser His His His 
400 405 410 415 

ATT GTG TCT CCA GGC AGT GGC AGT GCT GGA AGC TTG AGT AGG CTC CAT 
lie Val Ser Pro Gly Ser Gly Ser Ala Gly Ser Leu Ser Arg Leu His 

420 425 430 

CCA GTC CCA GTC CCA GCA GAT TGG ATG AAC CGC ATG GAG GTG AAT GAG 
Pro Val Pro Val Pro Ala Asp Trp Met Asn Arg Met Glu Val Asn Glu 

435 440 445 

ACC CAG TAC AAT GAG ATG TTT GGT ATG GTC CTG GCT CCA GCT GAG GGC 
Thr Gin Tyr Ash Glu Met Phe Gly Met Val Leu Ala Pro Ala Glu Gly 
450 455 460 

ACC CAT CCT GGC ATA GCT CCC CAG AGC AGG CCA CCT GAA GGG AAG CAC 
Thr His Pro Gly lie Ala Pro Gin Ser Arg Pro Pro Glu Gly Lys His 
465 470 475 
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ATA ACC ACC CCT CGG GAG CCC TTG CCC CCC ATT 6TG fflCCT TTC CAG CTC 1486 
lie Thr Thr Pro Arg Glu Pro Leu Pro Pro lie Va3L Tffiar Phe Gin Leu 
480 485 490 495 

ATC CCT AAA GGC AGT ATT GCC CAA CCA GCG GGG GCT ©EC GAG CCT CAG 1534 
lie Pro Lys Gly Ser lie Ala Gin Pro Ala Gly Alai fiteo Gin Pro Gin 

500 505 510 

TCC ACC TGC CCT CCA GCT GTT GCG GGC CCC CTG CCC 2B82C ATG TAC CAG 1582 
Ser Thr Cys Pro Pro Ala Val Ala Gly Pro Leu Pro* TSSur Met Tyr Gin 

515 520 525 

ATT CCA GAA ATG GCC CGT TTG CCC AGT GTG GCT TTC ©EC ACT GCC ATG 1630 
He Pro Glu Met Ala Arg Leu Pro Ser Val Ala Phe fftoo Thr Ala Met 
530 535 540 

ATG CCC CAG CAG GAC GGG CAG GTA GCT CAG ACC AT1P C3TC CCA GCC TAT 1678 
Met Pro Gin Gin Asp Gly Gin Val Ala Gin Thr lies ILeu Pro Ala Tyr 
545 550 555 

CAT CCT TTC CCA GCC TCT GTG GGC AAG TAC CCC ACA GDC CCT TCA CAG 1726 
His Pro Phe Pro Ala Ser Val Gly Lys Tyr Pro Thr fftoo Pro Ser Gin 
560 565 570 575 

CAC AGT TAT GCT TCC TCA AAT GCT GCT GAG CGA ACA ©DC AGT CAC AGT 1774 
His Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr: Piro Ser His Ser 

580 585 590 

GGT CAC CTC CAG GGT GAG CAT CCC TAC CTG ACA CCA TOC CCA GAG TCT 1822 
Gly His Leu Gin Gly Glu His Pro Tyr Leu Thr Pro Ser Pro Glu Ser 

595 600 €05 

CCT GAC CAG TGG TCA AGT TCA TCA CCC CAC TCT GCT TCT GAC TGG TCA 1870 
Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser Ala Ser Asp Trp Ser 
610 615 620 

GAT GTG ACC ACC AGC CCT ACC CCT GGG GGT GCT GGA GGA GGT CAG CGG 1918 
Asp Val Thr Thr Ser Pro Thr Pro Gly Gly Ala Gly Gly Gly Gin Arg 
625 630 635 

GGA CCT GGG ACA CAC ATG TCT GAG CCA CCA CAC AAC AAC ATG CAG GTT 1966 
Gly Pro Gly Thr His Met Ser Glu Pro Pro His Asn Asn Met Gin Val 
640 645 650 655 

TAT GCG TGAGAGAGTC CACCTCCAGT GTAGAGACAT AACTGACTTT TGTAAATGCT 2022 
Tyr Ala 



GCTGAGGAAC AAATGAAGGT CATCCGGGAG AGAAATGAAG AAATCTCTGG AGCCAGCTTC 2082 

TAGAGGTAGG AAAGAGAAGA TGTTCTTATT CAGATAATGC AAGAGAAGCA ATTCGTCAGT 2142 

TTCACTGGGT ATCTGCAAGG CTTATTGATT ATTCTAATCT AATAAGAGAA GTTTGTGGAA 2202 

ATG C AAG ATG AATACAAGCC TTGGGTCCAT GTTTACTCTC TTCTATTTGG AGAATAAGAT 2262 

GGATGCTTAT TGAAGCCCAG ACATTCTTGC AGCTTGGACT GCATTTTAAG CCCTGCAGGC 2322 

TTCTGCCATA TCCATGAGAA GATTCTACAC TAGCGTCCTG TTGGGAATTA TGCCCTGGAA 2382 

TTCTGCCTGA ATTGACCTAC GCATCTCCTC CTCCTTGGAC ATTCTTTTGT CTTCATTTGG 2442 

TGCTTTTGGT TTTGCACCTC TCCGTGATTG TAGCCCTACC AGCATGTTAT AGGGCAAGAC 2502 

CTTTGTGCTT TTGATCATTC TGGCCCATGA AAG C AACTTT GGTCTCCTTT CCCCTCCTGT 2562 

CTTCCCGGTA TCCCTTGGAG TCTCACAAGG TTTACTTTGG TATGGTTCTC AGCACAAACC 2622 
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TTTCAAGTAT GTTGTTTCTT TGGAAAATGG ACATACTGTA TTGTGTTCTC CTGCATATAT 2682 

CATTCCTGGA GAGAGAAGGG GAGAAGAATA CTTTTCTTCA ACAAATTTTG GGGGCAGGAG 2742 

ATCCCTTCAA GAGGCTGCAC CTTAATTTTT CTTGTCTGTG TGCAGGTCTT CATATAAACT 2802 

TTACCAGGAA GAAGGGTGTG AGTTTGTTGT TTTTCTGTGT ATGGGCCTGG TC AGTG T AAA 2862 

GTTTTATCCT TGATAGTCTA GTTACTATGA CCCTCCCCAC * TTTTTT AAAA CCAGAAAAAG 2922 

GTTTGGAATG TTGGAATGAC CAAGAGACAA GTTAACTCGT GCAAGAGCCA GTTACCCACC 2982 

CACAGGTCCC CCTACTTCCT G CCAAGC ATT CCATTGACTG CCTGTATGGA ACACATTTGT 3042 

CCCAGATCTG AG C ATTCT AG GCCTGTTTCA CTCACTCACC CAGCATATGA AACTAGTCTT 3102 

AACTGTTGAG CCTTTCCTTT CATATCCACA GAAGACACTG TCTCAAATGT TGTACCCTTG 3162 

CCATTTAGGA CTGAACTTTC CTT AG CCCAA GGGACCCAGT GACAGTTGTC TTCCGTTTGT 3222 

CAGATGATCA GTCTCTACTG ATTATCTTGC TGCTTAAAGG CCTGCTCACC AATCTTTCTT 3282 

TCACACCGTG TGGTCCGTGT TACTGGTATA CCCAGTATGT TCTCACTGAA GACATGGACT 3342 

TTATATGTTC AAGTGCAGGA ATTGGAAAGT TGGACTTGTT TTCTATGATC CAAAACAGCC 3402 

CTATAAGAAG GTTGGAAAAG GAGGAACTAT ATAGCAGCCT TTGCTATTTT CTGCTACCAT 3462 

TTCTTTTCCT CTGAAGCGGC CATGACATTC CCTTTGGCAA CTAACGTAGA AACTCAACAG 3522 

AACATTTTCC TTTCCTAGAG TCACCTTTTA GATGATAATG GACAACTATA GACTTGCTCA 3582 

TTGTTCAGAC TGATTGCCCC TCACCTGAAT CCACTCTCTG TATTCATGCT CTTGGCAATT 3642 

TCTTTGACTT TCTTTTAAGG GCAGAAGCAT TTTAGTTAAT TGTAGATAAA GAATAGTTTT 3702 

CTTCCTCTTC TCCTTGGGCC AGTTAATAAT TGGTCCATGG CTACACTGCA ACTTCCGTCC 3762 

AGTGCTGTGA TGCCCATGAC ACCTGCAAAA TAAGTTCTGC CTGGG CATTT TGTAGATATT 3822 

AACAGGTGAA TTCCCGACTC TTTTGGTTTG AATGACAGTT CTCATTCCTT CTATGGCTGC 3882 

AAGTATGCAT CAGTGCTTCC CACTTACCTG ATTTGTCTGT CGGTGGCCCC ATATGGAAAC 3942 

CCTGCGTGTC TGTTGG CAT A ATAGTTTACA AATGGTTTTT TCAGTCCTAT CCAAATTTAT 4002 

TGAACCAACA AAAATAATTA CTTCTGCCCT GAGATAAGCA GATTAAGTTT GTTCATTCTC 4062 

TGCTTTATTC TCTCCATGTG GCAACATTCT GTCAGCCTCT TTCATAGTGT GCAAACATTT 4122 

TATCATTCTA AATGGTGACT CTCTGCCCTT GGACCCATTT ATTATTCACA GATGGGGAGA 4182 

ACCTATCTGC ATGGACCCTC ACCATCCTCT GTGCAGCACA CACAGTGCAG GGAGCCAGTG 4242 

GCGATGGCGA TGACTTTCTT CCCCTG 4268 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
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Glu Val Asp Val Leu Asp Val Asn Val Arg Gly Pro Asp Gly Cys Thr 
15 10 15 

Pro Leu Met Leu Ala Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser Asp 

20 25 30 

Glu Asp Glu Asp Ala Glu Asp Ser Ser Ala Asn He He Thr Asp Leu 

35 40 45 

Val Tyr Gin Gly Ala Ser Leu Gin Ala Gin Thr Asp Arg Thr Gly Glu 
50 55 60 

Met Ala Leu His Leu Ala Ala Arg Tyr Ser Arg Ala Asp Ala Ala Lys 
65 70 75 80 

Arg Leu Leu Asp Ala Gly Ala Asp Ala Asn Ala Gin Asp Asn Met Gly 

85 90 95 

Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gin Gly Val Phe 

100 105 110 

Gin lie Leu lie Arg Asn Arg Val Thr Asp Leu Asp Ala Arg Met Asn 
115 120 125 

Asp Gly Thr Thr Pro Leu lie Leu Ala Ala Arg Leu Ala Val Glu Gly 
130 135 140 

Met Val Ala Glu Leu He Asn Cys Gin Ala Asp Val Asn Ala Val Asp 
145 150 155 160 

Asp His Gly Lys Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn Val 

165 170 175 

Glu Ala Thr Leu Leu Leu Leu Lys Asn Gly Ala Asn Arg Asp Met Gin 

180 185 190 

Asp Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser 
195 200 205 

Tyr Glu Ala Ala Lys lie Leu Leu Asp His Phe Ala Asn Arg Asp He 
210 215 220 

Thr Asp His Met Asp Arg Leu Pro Arg Asp Val Ala Arg Asp Arg Met 
225 230 235 240 

His His Asp He Val Arg Leu Leu Asp Glu Tyr Asn Val Thr Pro Ser 

245 250 255 

Pro Pro Gly Thr Val Leu Thr Ser Ala Leu Ser Pro Val He Cys Gly 

260 265 270 

Pro Asn Arg Ser Phe Leu Ser Leu Lys His Thr Pro Met Gly Lys Lys 
275 280 285 

Ser Arg Arg Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro Asn 
290 295 300 

Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly Ser Arg Arg Lys Lys Ser 
305 310 315 320 

Leu Ser Glu Lys. Val Gin Leu Ser Glu Ser Ser Val Thr Leu Ser Pro 

325 330 335 

Val Asp Ser Leu Glu Ser Pro His Thr Tyr Val Ser Asp Thr Thr Ser 

340 345 350 

Ser Pro Met He Thr Ser Pro Gly He Leu Gin Ala Ser Pro Asn Pro 
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355 360 365 

Met Leu Ala Thr Ala Ala Pro Pro Ala Pro Val His Ala Gin His Ala 
370 375 380 

Leu Ser Phe Ser Asn Leu His Glu Met Gin Pro Leu Ala His Gly Ala 
385 390 395 400 

Ser Ttjr Val Leu Pro Ser Val Ser Gin Leu Leu Ser His His His lie 

405 410 415 

Val Ser Pro Gly Ser Gly Ser Ala Gly Ser Leu Ser Arg Leu His Pro 

420 425 430 

Val Pro Val Pro Ala Asp Trp Met Asn Arg Met Glu Val Asn Glu Thr 
435 440 445 

Gin Tyr Asn Glu Met Phe Gly Met Val Leu Ala Pro Ala Glu Gly Thr 
450 455 460 

His Pro Gly lie Ala Pro Gin Ser Arg Pro Pro Glu Gly Lys His lie 
465 470 475 480 

Thr Thr Pro Arg Glu Pro Leu Pro Pro lie Val Thr Phe Gin Leu lie 

485 490 495 

Pro Lys Gly Ser lie Ala Gin Pro Ala Gly Ala Pro Gin Pro Gin Ser 

500 505 510 

Thr Cys Pro Pro Ala Val Ala Gly Pro Leu Pro Thr Met Tyr Gin lie 
515 520 525 

Pro Glu Met Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met Met 
S30 535 . 540 

Pro Gin Gin Asp Gly Gin Val Ala Gin Thr lie Leu Pro Ala Tyr His 
545 550 555 560 

Pro Phe Pro Ala Ser Val Gly Lys Tyr Pro Thr Pro Pro Ser Gin His 

565 570 575 

Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr Pro Ser His Ser Gly 

580 585 590 

His Leu Gin Gly Glu His Pro Tyr Leu Thr Pro Ser Pro Glu Ser Pro 
595 600 605 

Asp Gin Trp Ser Ser Ser Ser Pro His Ser Ala Ser Asp Trp Ser Asp 
610 615 620 

Val Thr Thr Ser Pro Thr Pro Gly Gly Ala Gly Gly Gly Gin Arg Gly 
625 630 635 640 

Pro Gly Thr His Met Ser Glu Pro Pro His Asn Asn Met Gin Val Tyr 

645 650 655 

Ala 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Glu Asp lie Asp Glu Cys Asp Gin Gly Ser Pro Cys Glu His Asn Gly 

15 10 15 

* 

0 " 

He Cys Val Asn Thr Pro Gly Ser Tyr Arg Cys Asn Cys Ser Gin Gly 

20 25 30 

Phe Thr Gly Pro Arg Cys Glu Thr Asn lie Asn Glu Cys Glu Ser His 
35 40 45 

Pro Cys Gin Asn Glu Gly Ser Cys Leu Asp Asp Pro Gly Thr Phe Arg 
50 55 60 

Cys Val Cys Met Pro Gly Phe Thr Gly Thr Gin Cys Glu 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Asn Asp Val Asp Glu Cys Ser Leu Gly Ala Asn Pro Cys Glu His Gly 
1 5 10 15 

Gly Arg Cys Thr Asn Thr Leu Gly Ser Phe Gin Cys Asn Cys Pro Gin 

20 25 30 

Gly Tyr Ala Gly Pro Arg Cys Glu lie Asp .Val Asn Glu Cys Leu Ser 
35 40 45 

Asn Pro Cys Gin Asn Asp Ser Thr Cys Leu Asp Gin lie Gly Glu Phe 
50 55 60 

Gin Cys lie Cys Met Pro Gly Tyr Glu Gly Leu Tyr Cys Glu 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
Thr Pro Pro Gin Gly Glu lie Glu Ala Asp Cys Met Asp Val Asn Val 



(15 
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15 10 15 

Arg Gly Pro Asp Gly Phe Thr Pro Leu Met lie Ala Ser Cys Ser Gly 

20 25 30 

Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu Glu Asp Ala Ser Ala 
35 40 45 

Aen Met lie Ser Asp Phe lie Gly Gin Gly Ala Gin Leu His Asn Gin 
50 55 60 

Thr Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala Ala Arg Tyr Ala 
65 70 75 80 

Arg Ala Asp Ala Ala Lys Arg Leu Leu Glu Ser Ser Ala Asp Ala Asn 

85 90 95 

Val Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala Ala Val Ala Ala 

100 105 110 

Asp Ala Gin Gly Val Phe Gin lie Leu lie Arg Asn Arg Ala Thr Asp 
115 120 125 

Leu Asp Ala Arg Met Phe Asp Gly Thr Thr Pro Leu lie Leu Ala Ala 
130 135 140 

Arg Leu Ala Val Glu Gly Met Val Glu Glu Leu lie Asn Ala His Ala 
145 150 155 160 

Asp Val Asn Ala Val Asp Glu Phe Gly Lys Ser Ala Leu His Trp Ala 

165 170 175 

Ala Ala Val Asn Asn Val Asp Ala Ala Ala Val Leu Leu Lys Asn Ser 

180 185 190 

Ala Asn Lys Asp Met Gin Asn Asn Lys Glu Glu Thr Ser Leu Phe Leu 
195 200 205 

Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val Leu Leu Asp His 
210 215 220 

Tyr Ala Asn Arg Asp lie Thr Asp His Met Asp Arg Leu Pro Arg Asp 
225 230 235 240 

lie Ala Gin Glu Arg Met His His Asp lie Val His Leu Leu Asp Glu 

245 250 255 

Tyr Asn Leu Val Lys Ser Pro Thr Leu His Asn Gly Pro Leu Gly Ala 

260 265 270 

Thr Thr Leu Ser Pro Pro lie Cys Ser Pro Asn Gly Tyr Met Gly Asn 
275 280 285 

Met Lys Pro Ser Val Gin Ser Lys Lys Ala Arg Lys Pro Ser lie Lys 
290 295 300 

Gly Asn Gly Cys Lys Glu Ala Lys Glu Leu Lys Ala Arg Arg Lys Lys 
305 310 315 320 

Ser Gin Asp Gly Lys Thr Thr Leu Leu Asp Ser Gly Ser Ser Gly Val 

325 330 335 

Leu Ser Pro Val Asp Ser Leu Glu Ser Thr His Gly Tyr Leu Ser Asp 

340 345 350 

Val Ser Ser Pro Pro Leu Met Thr Ser Pro Phe Gin Gin Ser Pro Ser 
355 360 365 
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Met Pro Leu Asn His Leu Thr Ser Met Pro Glu Ser Gin Leu Gly Met 
370 375 . 380 

Asn His lie Asn Met Ala Thr Lys Gin Glu Met Ala Ala Gly Ser Asn 
385 390 395 400 

Arg Met Ala Phe Asp Ala Met Val Pro Arg Leu Thr His Leu Asn Ala 

405 410 415 

Ser Ser Pro Asn Thr lie Met Ser Asn Gly Ser Met His Phe Thr Val 

420 425 430 

Gly Gly Ala Pro Thr Met Asn Ser Gin Cys Asp Trp Leu Ala Arg Leu 
435 440 445 

Gin Asn Gly Met Val Gin Asn Gin Tyr Asp Pro lie Arg Asn Gly lie 
450 455 460 

Gin Gin Gly Asn Ala Gin Gin Ala Gin Ala Leu Gin His Gly Leu Met 
465 470 475 480 

Thr Ser Leu His Asn Gly Leu Pro Ala Thr Thr Leu Ser Gin Met Met 

485 490 495 

Thr Tyr Gin Ala Met Pro Asn Thr Arg Leu Ala Asn Gin Pro His Leu 

500 505 510 

Met Gin Ala Gin Gin Met Gin Gin Gin Gin Asn Leu Gin Leu His Gin 
515 520 525 

Ser Met Gin Gin Gin His His Asn Ser Ser Thr Thr Ser Thr His lie 
530 535 540 

Asn Ser Pro Phe Cys Ser Ser Asp lie Ser Gin Thr Asp Leu Gin Gin 
545 550 555 560 

Met Ser Ser Asn Asn lie His Ser Val Met Pro Gin Asp Thr Gin lie 

565 570 575 

Phe Ala Ala Ser Leu Pro Ser Asn Leu Thr Gin Ser Met Thr Thr Ala 

580 585 590 

Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro Met Asp 
595 600 605 

Asn Thr Pro Ser His Gin Leu Gin Val Pro Asp His Pro Phe Leu Thr 
610 615 620 

Pro Ser Pro Glu Ser Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser 
625 630 635 640 

Asn Met Ser Asp Trp Ser Glu Gly lie Ser Ser Pro Pro Thr 

645 650 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 666 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
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Thr Pro Pro Gin Gly Glu Val Asp Ala Asp Cys Met Asp Val Asn Val 
15 10 15 

Arg Gly Pro Asp Gly Phe Thr Pro Leu Met lie Ala Ser Cys Ser Gly 

20 25 30 

Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu Glu Asp Ala Pro Ala 
35 40 45 



* 



Val lie Ser Asp Phe lie Tyr Gin Gly Ala Ser Leu His Asn Gin Thr 
50 55 60 

Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala Ala Arg Tyr Ser Arg 
65 70 75 80 

Ser Asp Ala Ala Lye Arg Leu Leu Glu Ala Ser Ala Asp Ala Asn lie 

85 90 95 

Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala Ala Val Ser Ala Asp 

100 105 110 

Ala Gin Gly Val Phe Gin lie Leu Leu Arg Asn Arg Ala Thr Asp Leu 
115 120 125 

Asp Ala Arg Met His Asp Gly Thr Thr Pro Leu lie Leu Ala Ala Arg 
130 135 140 

Leu Ala Val Glu Gly Met Leu Glu Asp Leu lie Asn Ser His Ala Asp 
145 150 155 160 

Val Asn Ala Val Asp Asp Leu Gly Lys Ser Ala Leu His Trp Ala Ala 

165 170 175 

Ala Val Asn Asn Val Asp Ala Ala Val Val Leu Leu Lys Asn Gly Ala 

180 185 190 

Asn Lys Asp Met Gin Asn Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala 
195 200 205 

Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val Leu Leu Asp His Phe 
210 215 220 

Ala Asn Arg Asp lie Thr Asp His Met Asp Arg Leu Pro Arg Asp lie 
225 230 235 240 

Ala Gin Glu Arg Met His His Asp lie Val Arg Leu Leu Asp Glu Tyr 

245 250 255 

Asn Leu Val Arg Ser Pro Gin Leu His Gly Thr Ala Leu Gly Gly Thr 

260 265 270 

Pro Thr Leu Ser Pro Thr Leu Cys Ser Pro Asn Gly Tyr Leu Gly Asn 
275 280 285 

Leu Lys Ser Ala Thr Gin Gly Lys Lys Ala Arg Lys Pro Ser Thr Lys 
290 295 300 

Gly Leu Ala Cys Ser Ser Lys Glu Ala Lys Asp Leu Lys Ala Arg Arg 
305 310 315 320 

Lys Lys Ser Gin Asp Gly Lys Gly Cys Leu Leu Asp Ser Ser Ser Met 

325 330 335 

Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His Gly Tyr Leu Ser Asp 

340 345 350 

Val Ala Ser Pro Pro Leu Pro Ser Pro Phe Gin Gin Ser Pro Ser Met 



V 
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355 360 365 

Pro Leu Ser His Leu Pro Gly Met Pro Asp Thr His Leu Gly lie Ser 
370 375 380 

His Leu Asn Val Ala Ala Lys Pro Glu Met Ala Ala Leu Ala Gly Gly 
385 390 395 400 

Ser Arg Leu Ala Phe Glu Pro Pro Pro Pro Arg Leu Ser His Leu Pro 

405 410 415 

Val Ala Ser Ser Ala Ser Thr Val Leu Ser Thr Asn Gly Thr Gly Ala 

420 425 430 

Met Asn Phe Thr Val Gly Ala Pro Ala Ser Leu Asn Gly Gin Cys Glu 
435 440 445 

Trp Leu Pro Arg Leu Gin Asn Gly Met Val Pro Ser Gin Tyr Asn Pro 
450 455 460 

Leu Arg Pro Gly Val Thr Pro Gly Thr Leu Ser Thr Gin Ala Ala Gly 
465 470 475 480 

Leu Gin His Gly Met Met Ser Pro lie His Ser Ser Leu Ser Thr Asn 

485 490 495 

Thr Leu Ser Pro lie lie Tyr Gin Gly Leu Pro Asn Thr Arg Leu Ala 

500 505 510 

Thr Gin Pro His Leu Val Gin Thr Gin Gin Val Gin Pro Gin Asn Leu 
515 520 525 

Gin lie Gin Pro Gin Asn Leu Gin Pro Pro Ser Gin Pro His Leu Ser 
530 535 540 

Val Ser Ser Ala Ala Asn Gly His Leu Gly Arg Ser Phe Leu Ser Gly 
545 550 555 560 

Glu Pro Ser Gin Ala Asp Val Gin Pro Leu Gly Pro Ser Ser Leu Pro 

565 570 575 

Val His Thr lie Leu Pro Gin Glu Ser Gin Ala Leu Pro Thr Ser Leu 

580 585 590 

Pro Ser Ser Met Val Pro Pro Met Thr Thr Thr Gin Phe Leu Thr Pro 
595 600 605 

Pro Ser Gin His Ser Tyr Ser Ser Ser Pro Val Asp Asn Thr Pro Ser 
610 615 620 

His Gin Leu Gin Val Pro Glu His Pro Phe Leu Thr Pro Ser Pro Glu 
625 630 635 640 

Ser Pro Asp Gin Trp Ser Ser Ser Ser Arg His Ser Asn lie Ser Asp 

645 650 655 

Trp Ser Glu Gly lie Ser Ser Pro Pro Thr 

660 665 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Thr Pro Pro Gin Gly Glu Val Asp Ala Asp Cys Met Asp Val Asn Val 
15 10 15 



Arg Gly Pro Asp Gly Phe Thr Pro Leu Met lie Ala Ser Cys Ser Gly 

20 25 30 

Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu Glu Asp Ala Pro Ala 
35 40 45 

Val lie Ser Asp Phe lie Tyr Gin Gly Ala Ser Leu His Asn Gin Thr 
50 55 60 

Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala Ala Arg Tyr Ser Arg 
65 70 75 80 

Ser Asp Ala Ala Lys Arg Leu Leu Glu Ala Ser Ala Asp Ala Asn lie 

85 90 95 

Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala Ala Val Ser Ala Asp 

100 105 110 

Ala Gin Gly Val Phe Gin lie Leu lie Arg Asn Arg Ala Thr Asp Leu 
115 120 125 

Asp Ala Arg Met His Asp Gly Thr Thr Pro Leu lie Leu Ala Ala Arg 
130 135 140 

Leu Ala Val Glu Gly Met Leu Glu Asp Leu lie Asn Ser His Ala Asp 
145 150 155 160 

Val Asn Ala Val Asp Asp Leu Gly Lys Ser Ala Leu His Trp Ala Ala 

165 170 175 

Ala Val Asn Asn Val Asp Ala Ala Val Val Leu Leu Lys Asn Gly Ala 

180 185 190 

Asn Lys Asp Met Gin Asn Asn Arg Glu Glu Thr Pro Leu Phe Leu Ala 
195 200 205 

Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val Leu Leu Asp His Phe 
210 215 220 

Ala Asn Arg Asp lie Thr Asp His Met Asp Arg Leu Pro Arg Asp lie 
225 230 235 240 

Ala Gin Glu Arg Met His His Asp lie Val Arg Leu Leu Asp Glu Tyr 

245 250 255 

Asn Leu Val Arg Ser Pro Gin Leu His Gly Ala Pro Leu Gly Gly Thr 

260 265 270 

Pro Thr Leu Ser Pro Pro Leu Cys Ser Pro Asn Gly Tyr Leu Gly Ser 
275 280 285 

Leu Lys Pro Gly Val Gin Gly Lys Lys Val Arg Lys Pro Ser Ser Lys 
290 295 300 

Gly Leu Ala Cys Gly Ser Lys Glu Ala Lys Asp Leu Lys Ala Arg Arg 
305 310 315 320 

Lys Lys Ser Gin Asp Gly Lys Gly Cys Leu Leu Asp Ser Ser Gly Met 
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325 330 335 

Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His Gly Tyr Leu Ser Asp 

340 345 350 

Val Ala Ser Pro Pro Leu Leu Pro Ser Pro Phe Gin Gin Ser Pro Ser 
355 360 365 

Val Pro Leu Asn His Leu Pro Gly Met Pro Asp Thr His Leu Gly lie 
370 375 380 

Gly His Leu Asn Val Ala Ala Lys Pro Glu Met Ala Ala Leu Gly Gly 
385 390 395 400 

Gly Gly Arg Leu Ala Phe Glu Thr Gly Pro Pro Arg Leu Ser His Leu 

405 410 415 

Pro Val Ala Ser Gly Thr Ser Thr Val Leu Gly Ser Ser Ser Gly Gly 

420 425 430 

Ala Leu Asn Phe Thr Val Gly Gly Ser Thr Ser Leu Asn Gly Gin Cys 
435 440 445 

Glu Trp Leu Ser Arg Leu Gin Ser Gly Met Val Pro Asn Gin Tyr Asn 
450 455 460 

Pro Leu Arg Gly Ser Val Ala Pro Gly Pro Leu Ser Thr Gin Ala Pro 
465 470 475 480 

Ser Leu Gin His Gly Met Val Gly Pro Leu His Ser Ser Leu Ala Ala 

485 490 495 

Ser Ala Leu Ser Gin Met Met Ser Tyr Gin Gly Leu Pro Ser Thr Arg 

500 505 510 

Leu Ala Thr Gin Pro His Leu Val Gin Thr Gin Gin Val Gin Pro Gin 
515 520 525 

Asn Leu Gin Met Gin Gin Gin Asn Leu Gin Pro Ala Asn lie Gin Gin 
530 535 540 

Gin Gin Ser Leu Gin Pro Pro Pro Pro Pro Pro Gin Pro His Leu Gly 
545 550 555 560 

Val Ser Ser Ala Ala Ser Gly His Leu Gly Arg Ser Phe Leu Ser Gly 

565 570 575 

Glu Pro Ser Gin Ala Asp Val Gin Pro Leu Gly Pro Ser Ser Leu Ala 

580 585 590 

Val His Thr lie Leu Pro Gin Glu Ser Pro Ala Leu Pro Thr Ser Leu 
595 600 605 

Pro Ser Ser Leu Val Pro Pro Val Thr Ala Ala Gin Phe Leu Thr Pro 
610 615 620 

Pro Ser Gin His Ser Tyr Ser Ser Pro Val Glu Asn Thr Pro Ser His 
625 630 635 640 

Gin Leu Gin Val Pro Glu His Pro Phe Leu Thr Pro Ser Pro Glu Ser 

645 650 655 

Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser Asn Val Ser Asp Trp 

660 665 670 

Ser Glu Gly Val Ser Ser Pro Pro Thr 
675 680 



121- 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2471 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 19: 

Met Pro Ala Leu Arg Pro Ala Leu Leu Trp Ala Leu Leu Ala Leu Trp 
1 5 10 15 

Leu Cys Cys Ala Ala Pro Ala His Ala Leu Gin Cys Arg Asp Gly Tyr 

20 25 30 

Glu Pro Cys Val Asn Glu Gly Met Cys Val Thr Tyr His Asn Gly Thr 
35 40 45 

Gly Tyr Cys Lys Cys Pro Glu Gly Phe Leu Gly Glu Tyr Cys Gin His 
50 55 60 

Arg Asp Pro Cys Glu Lys Asn Arg Cys Gin Asn Gly Gly Thr Cys Val 
65 70 75 80 

Ala Gin Ala Met Leu Gly Lys Ala Thr Cys Arg Cys Ala Ser Gly Phe 

85 90 95 

Thr Gly Glu Asp Cys Gin Tyr Ser Thr Ser His Pro Cys Phe Val Ser 

100 105 110 

Arg Pro Cys Leu Asn Gly Gly Thr Cys His Met Leu Ser Arg Asp Thr 
115 120 125 

Tyr Glu Cys Thr Cys Gin Val Gly Phe thr Gly Lys Glu Cys Gin Trp 
130 135 140 

Thr Asp Ala Cys Leu Ser His Pro Cys Ala Asn Gly Ser Thr Cys Thr 
145 150 155 160 

Thr Val Ala Asn Gin Phe Ser Cys Lys Cys Leu Thr Gly Phe Thr Gly 

165 170 175 

Gin Lys Cys Glu Thr Asp Val Asn Glu Cys Asp lie Pro Gly His Cys 

180 185 190 

Gin His Gly Gly Thr Cys Leu Asn Leu Pro Gly Ser Tyr Gin Cys Gin 
195 200 205 

Cys Pro Gin Gly Phe Thr Gly Gin Tyr Cys Asp Ser Leu Tyr Val Pro 
210 215 220 

Cys Ala Pro Ser Pro Cys Val Asn Gly Gly Thr Cys Arg Gin Thr Gly 
225 230 235 240 

Asp Phe Thr Phe Glu Cys Asn Cys Leu Pro Gly Phe Glu Gly Ser Thr 

245 250 255 

Cys Glu Arg Asn lie Asp Asp Cys Pro Asn His Arg Cys Gin Asn Gly 

260 265 270 

Gly Val Cys Val Asp Gly Val Asn Thr Tyr Asn Cys Arg Cys Pro Pro 
275 280 285 
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Gln Trp Thr Gly Gin Phe Cys Thr Glu Asp Val Asp Glu Cys Leu Leu 
290 295 300 

Gin Pro Asn Ala Cys Gin Asn Gly Gly Thr Cys Ala Asn Arg Asn Gly 
305 310 315 320 

Gly Tyr Gly Cys Val Cys Val Asn Gly Trp Ser Gly Asp Asp Cys Ser 

325 330 335 

♦ 

Glu Asn lie Asp Asp Cys Ala Phe Ala Ser Cys Thr Pro Gly Ser Thr 

340 345 350 

Cys lie Asp Arg Val Ala Ser Phe Ser Cys Met Cys Pro Glu Gly Lys 
355 360 365 

Ala Gly Leu Leu Cys His Leu Asp Asp Ala Cys lie Ser Asn Pro Cys 
370 375 380 

His Lys Gly Ala Leu Cys Asp Thr Asn Pro Leu Asn Gly Gin Tyr lie 
385 390 395 400 

Cys Thr Cys Pro Gin Gly Tyr Lys Gly Ala Asp Cys Thr Glu Asp Val 

405 410 415 

Asp Glu Cys Ala Met: Ala Asn Ser Asn Pro Cys Glu His Ala Gly Lys 

420 425 430 

Cys Val Asn Thr Asp Gly Ala Phe His Cys Glu Cys Leu Lys Gly Tyr 
435 440 445 

Ala Gly Pro Arg Cys Glu Met: Asp lie Asn Glu Cys His Ser Asp Pro 
450 455 460 

Cys Gin Asn Asp Ala Thr Cys Leu Asp Lys lie Gly Gly Phe Thr Cys 
465 470 475 480 

Leu Cys Met Pro Gly Phe Lys Gly Val His Cys Glu Leu Glu lie Asn 

485 490 495 

Glu Cys Gin Ser Asn Pro Cys Val Asn Asn Gly Gin Cys Val Asp Lys 

500 505 510 

Val Asn Arg Phe Gin Cys Leu Cys Pro Pro Gly Phe Thr Gly Pro Val 
515 520 525 

Cys Gin lie Asp lie Asp Asp Cys Ser Ser Thr Pro Cys Leu Asn Gly 
530 535 540 

Ala Lys Cys lie Asp His Pro Asn Gly Tyr Glu Cys Gin Cys Ala Thr 
545 550 555 560 

Gly Phe Thr Gly Val Leu Cys Glu Glu Asn lie Asp Asn Cys Asp Pro 

565 570 575 

Asp Pro Cys His His Gly Gin Cys Gin Asp Gly lie Asp Ser Tyr Thr 

580 585 590 

Cys lie Cys Asn Pro Gly Tyr Met Gly Ala lie Cys Ser Asp Gin lie 
595 600 605 

Asp Glu Cys Tyr Ser Ser Pro Cys Leu Asn Asp Gly Arg Cys lie Asp 
610 615 620 

Leu Val Asn Gly Tyr Gin Cys Asn Cys Gin Pro Gly Thr Ser Gly Val 
625 630 635 640 

Asn Cys Glu lie Asn Phe Asp Asp Cys Ala Ser Asn Pro Cys lie His 
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645 



650 



655 



Gly lie 

Gly Phe 

Aon Pro 
690 

Arg Cys 
705 

Val Asn 

Gly Leu 

Asn Cys 

Gly Gly 
770 

Lys Gly 
785 

Ser Asn 

Tyr Thr 

Val Leu 

Lys Glu 
850 

Trp Gin 
865 

Pro Cys 

Cys Glu 

Asp Asp 

Gly Val 
930 

Lys Cys 
945 

Gly Gly 
Ala Gly 
Glu Ser 



Cys Met: Asp Gly lie Asn Arg Tyr Ser Cys Val Cys Ser Pro 
660 665 670 

Thr Gly Gin Arg Cys Asn lie Asp lie Asp Glu Cys Ala Ser 
675 680 685 

Cys Arg Lys Gly Ala Thr Cys lie Asn Gly Val Asn Gly Phe 

695 700 

lie Cys Pro Glu Gly Pro His His Pro Ser Cys Tyr Ser Gin 

710 715 720 

Glu Cys Leu Ser Asn Pro Cys He His Gly Asn Cys Thr Gly 
725 730 735 

Ser Gly Tyr Lys Cys Leu Cys Asp Ala Gly Trp Val Gly He 
740 745 750 

Glu Val Asp Lys Asn Glu Cys Leu Ser Asn Pro Cys Gin Asn 
755 760 765 

Thr Cys Asp Asn Leu Val Asn Gly Tyr Arg Cys Thr Cys Lys 

775 780 

Phe Lys Gly Tyr Asn Cys Gin Val Asn He Asp Glu Cys Ala 

790 795 800 

Pro Cys Leu Asn Gin Gly Thr Cys Phe Asp Asp lie Ser Gly 
805 810 815 

Cys His Cys Val Leu Pro Tyr Thr Gly Lys Asn Cys Gin Thr 
820 825 830 

Ala Pro Cys Ser Pro Asn Pro Cys Glu Asn Ala Ala Val Cys 
835 840 845 

Ser Pro Asn Phe Glu Ser Tyr Thr Cys Leu Cys Ala Pro Gly 

855 860 

Gly Gin Arg Cys Thr He Asp He Asp Glu Cys He Ser Lys 

870 875 880 

Met Asn His Gly Leu Cys His Asn Thr Gin Gly Ser Tyr Met 
885 890 895 

Cys Pro Pro Gly Phe Ser Gly Met Asp Cys Glu Glu Asp He 
900 905 910 

Cys Leu Ala Asn Pro Cys Gin Asn Gly Gly Ser Cys Met Asp 
915 920 925 

Asn Thr Phe Ser Cys Leu Cys Leu Pro Gly Phe Thr Gly Asp 

935 940 

Gin Thr Asp Met Asn Glu Cys Leu Ser Glu Pro Cys Lys Asn 

950 955 960 

Thr Cys Ser Asp Tyr Val Asn Ser Tyr Thr Cys Lys Cys Gin 
965 970 975 

Phe Asp Gly Val His Cys Glu Asn Asn He Asn Glu Cys Thr 
980 985 990 

Ser Cys Phe Asn Gly Gly Thr Cys Val Asp Gly He Asn Ser 
995 1000 1005 
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Phe Ser Cys Leu Cys Pro Val Gly Phe Thr Gly Ser Phe Cys Leu His 
1010 1015 1020 

Glu lie Asn Glu Cys Ser Ser His Pro Cys Leu Asn Glu Gly Thr Cys 
1025 1030 1035 1040 

Val Asp Gly Leu Gly Thr Tyr Arg Cys Ser Cys Pro Leu Gly Tyr Thr 

1045 1050 1055 



Gly Lys Asn Cys Gin Thr Leu Val Asn Leu Cys Ser Arg Ser Pro Cys 

1060 1065 1070 

Lys Asn Lys Gly Thr Cys Val Gin Lys Lys Ala Glu Ser Gin Cys Leu 
1075 1080 1085 

Cys Pro Ser Gly Trp Ala Gly Ala Tyr Cys Asp Val Pro Asn Val Ser 
1090 1095 1100 

Cys Asp lie Ala Ala Ser Arg Arg Gly Val Leu Val Glu His Leu Cys 
1105 1110 1115 1120 

Gin His Ser Gly Val Cys lie Asn Ala Gly Asn Thr His Tyr Cys Gin 

1125 1130 1135 

Cys Pro Leu Gly Tyr Thr Gly Ser Tyr Cys Glu Glu Gin Leu Asp Glu 

1140 1145 1150 

Cys Ala Ser Asn Pro Cys Gin His Gly Ala Thr Cys Ser Asp Phe lie 
1155 1160 1165 

Gly Gly Tyr Arg Cys Glu Cys Val Pro Gly Tyr Gin Gly Val Asn Cys 
1170 1175 1180 

Glu Tyr Glu Val Asp Glu Cys Gin Asn Gin Pro Cys Gin Asn Gly Gly 
1185 1190 1195 1200 

Thr Cys lie Asp Leu Val Asn His Phe Lys Cys Ser Cys Pro Pro Gly 

1205 1210 1215 

Thr Arg Gly Leu Leu Cys Glu Glu Asn lie Asp Asp Cys Ala Arg Gly 

1220 1225 1230 

Pro His Cys Leu Asn Gly Gly Gin Cys Met Asp Arg lie Gly Gly Tyr 
1235 1240 1245 

Ser Cys Arg Cys Leu Pro Gly Phe Ala Gly Glu Arg Cys Glu Gly Asp 
1250 1255 1260 

lie Asn Glu Cys Leu Ser Asn Pro Cys Ser Ser Glu Gly Ser Leu Asp 
1265 1270 1275 1280 

Cys lie Gin Leu Thr Asn Asp Tyr Leu Cys Val Cys Arg Ser Ala Phe 

1285 1290 1295 

Thr Gly Arg His Cys Glu Thr Phe Val Asp Val Cys Pro Gin Met Pro 

1300 1305 1310 

Cys Leu Asn Gly Gly Thr Cys Ala Val Ala Ser Asn Met Pro Asp Gly 
1315 1320 1325 

Phe lie Cys Arg Cys Pro Pro Gly Phe Ser Gly Ala Arg Cys Gin Ser 
1330 1335 1340 

Ser Cys Gly Gin Val Lys Cys Arg Lys Gly Glu Gin Cys Val His Thr 
1345 1350 1355 1360 

Ala Ser Gly Pro Arg Cys Phe Cys Pro Ser Pro Arg Asp Cys Glu Ser 




# 
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1365 1370 1375 

Gly Cys Ala Ser Ser Pro Cys Gin His Gly Gly Ser Cys His Pro Gin 

1380 1385 1390 

Arg Gin Pro Pro Tyr Tyr Ser Cys Gin Cys Ala Pro Pro Phe Ser Gly 
1395 1400 1405 

Ser Arg Cys Glu Leu Tyr Thr Ala Pro Pro Ser Thr Pro Pro Ala Thr 
1410 1415 1420 

Cys Leu Ser Gin Tyr Cys Ala Asp Lys Ala Arg Asp Gly Val Cys Asp 
1425 1430 1435 1440 

Glu Ala Cys Asn Ser His Ala Cys Gin Trp Asp Gly Gly Asp Cys Ser 

1445 1450 1455 

Leu Thr Met Glu Asn Pro Trp Ala Asn Cys Ser Ser Pro Leu Pro Cys 

1460 1465 1470 

Trp Asp Tyr lie Asn Asn Gin Cys Asp Glu Leu Cys Asn Thr Val Glu 
1475 1480 1485 

Cys Leu Phe Asp Asn Phe Glu Cys Gin Gly Asn Ser Lys Thr Cys Lys 
1490 1495 1500 

Tyr Asp Lys Tyr Cys Ala Asp His Phe Lys Asp Asn His Cys Asn Gin 
1505 1510 1515 1520 

Gly Cys Asn Ser Glu Glu Cys Gly Trp Asp Gly Leu Asp Cys Ala Ala 

1525 1530 1535 

Asp Gin Pro Glu Asn Leu Ala Glu Gly Thr Leu Val lie Val Val Leu 

1540 1545 1550 

Met Pro Pro Glu Gin Leu Leu Gin Asp Ala Arg Ser Phe Leu Arg Ala 
1555 1560 1565 

Leu Gly Thr Leu Leu His Thr Asn Leu Arg lie Lys Arg Asp Ser Gin 
1570 1575 1580 

Gly Glu Leu Met Val Tyr Pro Tyr Tyr Gly Glu Lys Ser Ala Ala Met 
1585 1590 1595 1600 

Lys Lys Gin Arg Met Thr Arg Arg Ser Leu Pro Gly Glu Gin Glu Gin 

1605 1610 1615 

Glu Val Ala Gly Ser Lys Val Phe Leu Glu lie Asp Asn Arg Gin Cys 

1620 1625 1630 

Val Gin Asp Ser Asp His Cys Phe Lys Asn Thr Asp Ala Ala Ala Ala 
1635 1640 1645 

Leu Leu Ala Ser His Ala lie Gin Gly Thr Leu Ser Tyr Pro Leu Val 
1650 1655 1660 

Ser Val Val Ser Glu Ser Leu Thr Pro Glu Arg Thr Gin Leu Leu Tyr 
1665 1670 1675 1680 

Leu Leu Ala Val Ala Val Val lie lie Leu Phe lie lie Leu Leu Gly 

1685 1690 1695 

Val lie Met Ala Lys Arg Lys Arg Lys His Gly Ser Leu Trp Leu Pro 

1700 1705 1710 

Glu Gly Phe Thr Leu Arg Arg Asp Ala Ser Asn His Lys Arg Arg Glu 
1715 1720 1725 
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Pro Val Gly Gin Asp Ala Val Gly Leu Lys Asn Leu Ser Val Gin Val 
1730 1735 1740 

Ser Glu Ala Asn Leu lie Gly Thr Gly Thr Ser Glu His Trp Val Asp 
1745 1750 1755 1760 

Asp Glu Gly Pro Gin Pro Lys Lys Val Lys Ala Glu Asp Glu Ala Leu 

1765 1770 1775 

Leu Ser Glu Glu Asp Asp Pro lie Asp Arg Arg Pro Trp Thr Gin Gin 

1780 1785 1790 

His Leu Glu Ala Ala Asp lie Arg Arg Thr Pro Ser Leu Ala Leu Thr 
1795 1800 1805 

Pro Pro Gin Ala Glu Gin Glu Val Asp Val Leu Asp Val Asn Val Arg 
1810 1815 1820 

Gly Pro Asp Gly Cys Thr Pro Leu Met Leu Ala Ser Leu Arg Gly Gly 
1825 1830 1835 1840 

Ser Ser Asp Leu Ser Asp Glu Asp Glu Asp Ala Glu Asp Ser Ser Ala 

1845 1850 1855 

Asn He lie Thr Asp Leu Val Tyr Gin Gly Ala Ser Leu Gin Ala Gin 

1860 1865 1870 

Thr Asp Arg Thr Gly Glu Met Ala Leu His Leu Ala Ala Arg Tyr Ser 
1875 1880 1885 

Arg Ala Asp Ala Ala Lys Arg Leu Leu Asp Ala Gly Ala Asp Ala Asn 
1890 1895 1900 

Ala Gin Asp Asn Met Gly Arg Cys Pro Leu His Ala Ala Val Ala Ala 
1905 1910 1915 1920 

Asp Ala Gin Gly Val Phe Gin lie Leu He Arg Asn Arg Val Thr Asp 

1925 1930 1935 

Leu Asp Ala Arg Met Asn Asp Gly Thr Thr Pro Leu He Leu Ala Ala 

1940 1945 1950 

Arg Leu Ala Val Glu Gly Met Val Ala Glu Leu lie Asn Cys Gin Ala 
1955 I960 1965 

Asp Val Asn Ala Val Asp Asp His Gly Lys Ser Ala Leu His Trp Ala 
1970 1975 1980 

Ala Ala Val Asn Asn Val Glu Ala Thr Leu Leu Leu Leu Lys Asn Gly 
1985 1990 1995 2000 

Ala Asn Arg Asp Met Gin Asp Asn Lys Glu Glu Thr Pro Leu Phe Leu 

2005 2010 2015 

Ala Ala Arg Glu Gly Ser Tyr Glu Ala Ala Lys He Leu Leu Asp His 

2020 2025 2030 

Phe Ala Asn Arg Asp He Thr Asp His Met Asp Arg Leu Pro Arg Asp 
2035 2040 2045 

Val Ala Arg Asp Arg Met His His Asp He Val Arg Leu Leu Asp Glu 
2050 2055 2060 

Tyr Asn Val Thr Pro Ser Pro Pro Gly Thr Val Leu Thr Ser Ala Leu 
2065 2070 2075 2080 

Ser Pro Val He Cys Gly Pro Asn Arg Ser Phe Leu Ser Leu Lys His 
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2085 2090 2095 

Thr Pro Met Gly Lys LysSer Arg Arg Pro Ser Ala Lys Ser Thr Met 

2100 2105 2110 

Pro Thr Ser Leu Pro Asn Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly 
2115 2120 2125 

Ser Arg Arg Lys Lys Ser Leu Ser Glu Lys Val Gin Leu Ser Glu Ser 
2130 2135 2140 

Ser Val Thr Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His Thr Tyr 
2145 2150 2155 2160 

Val Ser Asp Thr Thr Ser Ser Pro Met lie Thr Ser Pro Gly lie Leu 

2165 2170 2175 

Gin Ala Ser Pro Asn Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro 

2180 2185 2190 

Val His Ala Gin His Ala Leu Ser Phe Ser Asn Leu His Glu Met Gin 
2195 2200 2205 

Pro Leu Ala His Gly Ala Ser Thr Val Leu Pro Ser Val Ser Gin Leu 
2210 2215 2220 

Leu Ser His His His lie Val Ser Pro Gly Ser Gly Ser Ala Gly Ser 
2225 2230 2235 2240 

Leu Ser Arg Leu His Pro Val Pro Val Pro Ala Asp Trp Met Asn Arg 

2245 2250 2255 

Met Glu Val Asn Glu Thr Gin Tyr Asn Glu Met Phe Gly Met Val Leu 

2260 2265 2270 

Ala Pro Ala Glu Gly Thr His Pro Gly lie Ala Pro Gin Ser Arg Pro 
2275 2280 2285 

Pro Glu Gly Lys His lie Thr Thr Pro Arg Glu Pro Leu Pro Pro lie 
2290 2295 2300 

Val Thr Phe Gin Leu lie Pro Lys Gly Ser lie Ala Gin Pro Ala Gly 
2305 2310 2315 2320 

Ala Pro Gin Pro Gin Ser Thr Cys Pro Pro Ala Val Ala Gly Pro Leu 

2325 2330 2335 

Pro Thr Met Tyr Gin lie Pro Glu Met Ala Arg Leu Pro Ser Val Ala 

2340 2345 2350 

Phe Pro Thr Ala Met Met Pro Gin Gin Asp Gly Gin Val Ala Gin Thr 
2355 2360 2365 

lie Leu Pro Ala Tyr His Pro Phe Pro Ala Ser Val Gly Lys Tyr Pro 
2370 2375 2380 

Thr Pro Pro Ser Gin His Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg 
2385 2390 2395 2400 

Thr Pro Ser His Ser Gly His Leu Gin Gly Glu His Pro Tyr Leu Thr 

2405 2410 2415 

Pro Ser Pro Glu Ser Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser 

2420 2425 2430 

Ala Ser Asp Trp Ser Asp Val Thr Thr Ser Pro Thr Pro Gly Gly Ala 
2435 2440 2445 
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Gly Gly Gly Gin Arg Gly Pro Gly Thr His Met Ser Glu Pro Pro Hii 
2450 2455 2460 

Asn Asn Met Gin Val Tyr Ala 
2465 2470 

(2) INFORMATION FOR SEQ ID NO: 20: 

(±) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2556 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Pro Pro Leu Leu Ala Pro Leu Leu Cys Leu Ala Leu Leu Pro Ala 
1 5 10 15 

Leu Ala Ala Arg Gly Pro Arg Cys Ser Gin Pro Gly Glu Thr Cys Leu 

20 25 30 

Asn Gly Gly Lys Cys Glu Ala Ala Asn Gly Thr Glu Ala Cys Val Cys 
35 40 45 

Gly Gly Ala Phe Val Gly Pro Arg Cys Gin Asp Pro Asn Pro Cys Leu 
50 55 60 

Ser Thr Pro Cys Lys Asn Ala Gly Thr Cys His Val Val Asp Arg Arg 
65 70 75 80 

Gly Val Ala Asp Tyr Ala Cys Ser Cys Ala Leu Gly Phe Ser Gly Pro 

85 90 95 

Leu Cys Leu Thr Pro Leu Asp Asn Ala Cys Leu Thr Asn Pro Cys Arg 

100 105 110 

Asn Gly Gly Thr Cys Asp Leu Leu Thr Leu Thr Glu Tyr Lys Cys Arg 
115 120 125 

Cys Pro Pro Gly Trp Ser Gly Lys Ser Cys Gin Gin Ala Asp Pro Cys 
130 135 140 

Ala Ser Asn Pro Cys Ala Asn Gly Gly Gin Cys Leu Pro Phe Glu Ala 
145 150 155 160 

Ser Tyr lie Cys His Cys Pro Pro Ser Phe His Gly Pro Thr Cys Arg 

165 170 175 

Gin Asp Val Asn Glu Cys Gly Gin Lys Pro Arg Leu Cys Arg His Gly 

180 185 190 

Gly Thr Cys His Asn Glu Val Gly Ser Tyr Arg Cys Val Cys Arg Ala 
195 200 205 

Thr His Thr Gly Pro Asn Cys Glu Arg Pto Tyr Val Pro Cys Ser Pro 
210 215 220 

Ser Pro Cys Gin Asn Gly Gly Thr Cys Arg Pro Thr Gly Asp Val Thr 
225 230 235 240 

His Glu Cys Ala Cys Leu Pro Gly Phe Thr Gly Gin Asn Cys Glu Glu 

245 250 255 
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Asn He Asp Asp Cys Pro Gly Asn Asn Cys Lys Asn Gly Gly Ala Cys 

260 265 270 

Val Asp Gly Val Asn Thr Tyr Asn Cys Pro Cys Pro Pro Glu Trp Thr 
275 280 285 

Gly Gin Tyr Cys Thr Glu Asp Val Asp Glu Cys Gin Leu Met: Pro Asn 
290 295 300 

Ala Cys Gin Asn Gly Gly Thr Cys His Asn Thr His Gly Gly Tyr Asn 
305 310 315 320 

Cys Val Cys Val Asn Gly Trp Thr Gly Glu Asp Cys Ser Glu Asn He 

325 330 335 

Asp Asp Cys Ala Ser Ala Ala Cys Phe His Gly Ala Thr Cys His Asp 

340 345 350 

Arg Val Ala Ser Phe Tyr Cys Glu Cys Pro His Gly Arg Thr Gly Leu 
355 360 365 

Leu Cys His Leu Asn Asp Ala Cys lie Ser Asn Pro Cys Asn Glu Gly 
370 375 380 

Ser Asn Cys Asp Thr Asn Pro Val Asn Gly Lys Ala He Cys Thr Cys 
385 390 395 400 

Pro Ser Gly Tyr Thr Gly Pro Ala Cys Ser Gin Asp Val Asp Glu Cys 

405 410 415 

Ser Leu Gly Ala Asn Pro Cys Glu His Ala Gly Lys Cys He Asn Thr 

420 425 430 

Leu Gly Ser Phe Glu Cys Gin Cys Leu Gin Gly Tyr Thr Gly Pro Arg 
435 440 445 

Cys Glu He Asp Val Asn Glu Cys Val Ser Asn Pro Cys Gin Asn Asp 
450 455 460 

Ala Thr Cys Leu Asp Gin He Gly Glu Phe Gin Cys Met Cys Met: Pro 
465 470 475 480 

Gly Tyr Glu Gly Val His Cys Glu Val Asn Thr Asp Glu Cys Ala Ser 

485 490 495 

Ser Pro Cys Leu His Asn Gly Arg Cys Leu Asp Lys He Asn Glu Phe 

500 505 510 

Gin Cys Glu Cys Pro Thr Gly Phe Thr Gly His Leu Cys Gin Tyr Asp 
515 520 525 

Val Asp Glu Cys Ala Ser Thr Pro Cys Lys Asn Gly Ala Lys Cys Leu 
530 535 540. 

Asp Gly Pro Asn Thr Tyr Thr Cys Val Cys Thr Glu Gly Tyr Thr Gly 
545 550 555 560 

Thr His Cys Glu Val Asp He Asp Glu Cys Asp Pro Asp Pro Cys His 

565 570 575 

Tyr Gly Ser Cys Lys Asp Gly Val Ala Thr Phe Thr Cys Leu Cys Arg 

580 585 590 

Pro Gly Tyr Thr Gly His His Cys Glu Thr Asn He Asn Glu Cys Ser 
595 600 605 

Ser Gin Pro Cys Arg Leu Arg Gly Thr Cys Gin Asp Pro Asp Asn Ala 
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610 615 620 

Tyr Leu Cys Phe Cys Leu Lys Gly Thr Thr Gly Pro Asn Cys Glu lie 
625 630 635 640 

Asn Leu Asp Asp Cys Ala Ser Ser Pro Cys Asp Ser Gly Thr Cys Leu 

645 650 655 

Asp Lys lie Asp Gly Tyr Glu Cys Ala Cys Glu Pro Gly Tyr Thr Gly 

660 665 670 

Ser Met: Cys Asn Ser Asn lie Asp Glu Cys Ala Gly Asn Pro Cys His 
675 680 685 

Asn Gly Gly Thr Cys Glu Asp Gly He Asn Gly Phe Thr Cys Arg Cys 
690 695 700 

Pro Glu Gly Tyr His Asp Pro Thr Cys Leu Ser Glu Val Asn Glu Cys 
705 710 715 720 

Asn Ser Asn Pro Cys Val His Gly Ala Cys Arg Asp Ser Leu Asn Gly 

725 730 735 

Tyr Lys Cys Asp Cys Asp Pro Gly Trp Ser Gly Thr Asn Cys Asp lie 

740 745 750 

Asn Asn Asn Glu Cys Glu Ser Asn Pro Cys Val Asn Gly Gly Thr Cys 
755 760 765 

Lys Asp Met Thr Ser Gly lie Val Cys Thr Cys Arg Glu Gly Phe Ser 
770 775 780 

Gly Pro Asn Cys Gin Thr Asn lie Asn Glu Cys Ala Ser Asn Pro Cys 
785 790 795 800 

Leu Asn Lys Gly Thr Cys lie Asp Asp Val Ala Gly Tyr Lys Cys Asn 

805 810 815 

Cys Leu Leu Pro Tyr Thr Gly Ala Thr Cys Glu Val Val Leu Ala Pro 

820 825 830 

Cys Ala Pro Ser Pro Cys Arg Asn Gly Gly Glu Cys Arg Gin Ser Glu 
835 840 845 

Asp Tyr Glu Ser Phe Ser Cys Val Cys Pro Thr Ala Gly Ala Lys Gly 
850 855 860 

Gin Thr Cys Glu Val Asp lie Asn Glu Cys Val Leu Ser Pro Cys Arg 
865 870 875 880 

His Gly Ala Ser Cys Gin Asn Thr His Gly Gly Tyr Arg Cys His Cys 

885 890 895 

Gin Ala Gly Tyr Ser Gly Arg Asn Cys Glu Thr Asp lie Asp Asp Cys 

900 905 910 

Arg Pro Asn Pro Cys His Asn Gly Gly Ser Cys Thr Asp Gly lie Asn 
915 920 925 

Thr Ala Phe Cys Asp Cys Leu Pro Gly Phe Arg Gly Thr Phe Cys Glu 
930 935 940 

Glu Asp lie Asn Glu Cys Ala Ser Asp Pro. Cys Arg Asn Gly Ala Asn 
945 950 955 960 

Cys Thr Asp Cys Val Asp Ser Tyr Thr Cys Thr Cys Pro Ala Gly Phe 

965 970 975 
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Ser Gly lie His Cys Glu Asn Asn Thr Pro Asp Cys Thr Glu Ser Ser 

980 985 990 

Cys Phe Asn Gly Gly Thr Cys Val Asp Gly lie Asn Ser Phe Thr Cys 
995 1000 1005 

Leu Cys Pro Pro Gly Phe Thr Gly Ser Tyr Cys Gin His Val Val Asn 
1010 1015 1020 

Glu Cys Asp Ser Arg Pro Cys Leu Leu Gly Gly Thr Cys Gin Asp Gly 
1025 1030 1035 1040 

Arg Gly Leu His Arg Cys Thr Cys Pro Gin Gly Tyr Thr Gly Pro Asn 

1045 1050 1055 

Cys Gin Asn Leu Val His Trp Cys Asp Ser Ser Pro Cys Lys Asn Gly 

1060 1065 1070 

Gly Lys Cys Trp Gin Thr His Thr Gin Tyr Arg Cys Glu Cys Pro Ser 
1075 1080 1085 

Gly Trp Thr Gly Leu Tyr Cys Asp Val Pro Ser Val Ser Cys Glu Val 
1090 1095 1100 

Ala Ala Gin Arg Gin Gly Val Asp Val Ala Arg Leu Cys Gin His Gly 
1105 1110 1115 1120 

Gly Leu Cys Val Asp Ala Gly Asn Thr His His Cys Arg Cys Gin Ala 

1125 1130 1135 

Gly Tyr Thr Gly Ser Tyr Cys Glu Asp Leu Val Asp Glu Cys Ser Pro 

1140 1145 1150 

Ser Pro Cys Gin Asn Gly Ala Thr Cys Thr Asp Tyr Leu Gly Gly Tyr 
1155 1160 1165 

Ser Cys Lys Cys Val Ala Gly Tyr His Gly Val Asn Cys Ser Glu Glu 
1170 1175 1180 

lie Asp Glu Cys Leu Ser His Pro Cys Gin Asn Gly Gly Thr Cys Leu 
1185 1190 1195 1200 

Asp Leu Pro Asn Thr Tyr Lys Cys Ser Cys Pro Arg Gly Thr Gin Gly 

1205 1210 1215 

Val His Cys Glu lie Asn Val Asp Asp Cys Asn Pro Pro Val Asp Pro 

1220 1225 1230 

Val Ser Arg Ser Pro Lys Cys Phe Asn Asn Gly Thr Cys Val Asp Gin 
1235 1240 1245 

Val Gly Gly Tyr Ser Cys Thr Cys Pro Pro Gly Phe Val Gly Glu Arg 
1250 1255 1260 

Cys Glu Gly Asp Val Asn Glu Cys Leu Ser Asn Pro Cys Asp Ala Arg 
1265 1270 1275 1280 

Gly Thr Gin Asn Cys Val Gin Arg Val Asn Asp Phe His Cys Glu Cys 

1285 1290 1295 

Arg Ala Gly His Thr Gly Arg Arg Cys Glu Ser Val lie Asn Gly Cys 

1300 1305 1310 

Lys Gly Lys Pro Cys Lys Asn Gly Gly Thr Cys Ala Val Ala Ser Asn 
1315 1320 1325 

Thr Ala Arg Gly Phe lie Cys Lys Cys Pro Ala Gly Phe Glu Gly Ala 
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1330 1335 1340 

Thr Cys Glu Asn Asp Ala Arg Thr Cys Gly Ser Leu Arg Cys Leu Asn 
1345 1350 1355 1360 

Gly Gly Thr Cys lie Ser Gly Pro Arg Ser Pro Thr Cys Leu Cys Leu 

1365 1370 1375 

Gly Pro Phe Thr Gly Pro Glu Cys Gin Phe Pro Ala Ser Ser Pro Cys 

1380 1385 1390 

Leu Gly Gly Asn Pro Cys Tyr Asn Gin Gly Thr Cys Glu Pro Thr Ser 
1395 1400 1405 

Glu Ser Pro Phe Tyr Arg Cys Leu Cys Pro Ala Lys Phe Asn Gly Leu 
1410 1415 1420 

Leu Cys His lie Leu Asp Tyr Ser Phe Gly Gly Gly Ala Gly Arg Asp 
1425 1430 1435 1440 

lie Pro Pro Pro Leu lie Glu Glu Ala Cys Glu Leu Pro Glu Cys Gin 

1445 1450 1455 

Glu Asp Ala Gly Asn Lys Val Cys Ser Leu Gin Cys Asn Asn His Ala 

1460 1465 1470 

Cys Gly Trp Asp Gly Gly Asp Cys Ser Leu Asn Phe Asn Asp Pro Trp 
1475 1480 1485 

Lys Asn Cys Thr Gin Ser Leu Gin Cys Trp Lys Tyr Phe Ser Asp Gly 
1490 1495 1500 

His Cys Asp Ser Gin Cys Asn Ser Ala Gly Cys Leu Phe Asp Gly Phe 
1505 1510 1515 1520 

Asp Cys Gin Arg Ala Glu Gly Gin Cys Asn Pro Leu Tyr Asp Gin Tyr 

1525 1530 1535 

Cys Lys Asp His Phe Ser Asp Gly His Cys Asp Gin Gly Cys Asn Ser 

1540 1545 1550 

Ala Glu Cys Glu Trp Asp Gly Leu Asp Cys Ala Glu His Val Pro Glu 
1555 1560 1565 

Arg Leu Ala Ala Gly Thr Leu Val Val Val Val Leu Met Pro Pro Glu 
1570 1575 1580 

Gin Leu Arg Asn Ser Ser Phe His Phe Leu Arg Glu Leu Ser Arg Val 
1585 1590 1595 1600 

Leu His Thr Asn Val Val Phe Lys Arg Asp Ala His Gly Gin Gin Met 

1605 1610 1615 

lie Phe Pro Tyr Tyr Gly Arg Glu Glu Glu Leu Arg Lys His Pro lie 

1620 1625 1630 

Lys Arg Ala Ala Glu Gly Trp Ala Ala Pro Asp Ala Leu Leu Gly Gin 
1635 1640 1645 

Val Lys Ala Ser Leu Leu Pro Gly Gly Ser Glu Gly Gly Arg Arg Arg 
1650 1655 1660 

Arg Glu Leu Asp Pro Met Asp Val Arg Gly Ser lie Val Tyr Leu Glu 
1665 1670 1675 1680 

lie Asp Asn Arg Gin Cys Val Gin Ala Ser Ser Gin Cys Phe Gin Ser 

1685 1690 1695 
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Ala Thr Asp Val Ala Ala Phe Leu Gly Ala Leu Ala Ser Leu Gly Ser 

1700 1705 1710 

Leu Asn lie Pro Tyr Lys lie Glu Ala Val Gin Ser Glu Thr Val Glu 
1715 1720 1725 

Pro Pro Pro Pro Ala Gin Leu His Phe Met Tyr Val Ala Ala Ala Ala 
1730 1735 1740 

Phe Val Leu Leu Phe Phe Val Gly Cys Gly Val Leu Leu Ser Arg Lys 
1745 1750 1755 1760 

Arg Arg Arg Gin His Gly Gin Leu Trp Phe Pro Glu Gly Phe Lys Val 

1765 1770 1775 

Ser Glu Ala Ser Lys Lys Lys Arg Arg Glu Glu Leu Gly Glu Asp Ser 

1780 1785 1790 

Val Gly Leu Lys Pro Leu Lys Asn Ala Ser Asp Gly Ala Leu Met Asp 
1795 1800 1805 

Asp Asn Gin Asn Glu Trp Gly Asp Glu Asp Leu Glu Thr Lys Lys Phe 
1810 1815 1820 

Arg Phe Glu Glu Pro Val Val Leu Pro Asp Leu Asp Asp Gin Thr Asp 
1825 1830 1835 1840 

His Arg Gin Trp Thr Gin Gin His Leu Asp Ala Ala Asp Leu Arg Met 

1845 1850 1855 

Ser Ala Met Ala Pro Thr Pro Pro Gin Gly Glu Val Asp Ala Asp Cys 

1860 1865 1870 

Met Asp Val Asn Val Arg Gly Pro Asp Gly Phe Thr Pro Leu Met lie 
1875 1880 1885 

Ala Ser Cys Ser Gly Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu 
1890 1895 1900 

Glu Asp Ala Pro Ala Val lie Ser Asp Phe lie Tyr Gin Gly Ala Ser 
1905 1910 1915 1920 

Leu His Asn Gin Thr Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala 

1925 1930 1935 

Ala Arg Tyr Ser Arg Ser Asp Ala Ala Lys Arg Leu Leu Glu Ala Ser 

1940 1945 1950 

Ala Asp Ala Asn lie Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala 
1955 1960 1965 

Ala Val Ser Ala Asp Ala Gin Gly Val Phe Gin lie Leu lie Arg Asn 
1970 1975 1980 

Arg Ala Thr Asp Leu Asp Ala Arg Met His Asp Gly Thr Thr Pro Leu 
1985 1990 1995 2000 

lie Leu Ala Ala Arg Leu Ala Val Glu Gly Met Leu Glu Asp Leu lie 

2005 2010 2015 

Asn Ser His Ala Asp Val Asn Ala Val Asp Asp Leu Gly Lys Ser Ala 

2020 2025 2030 

Leu His Trp Ala Ala Ala Val Asn Asn Val Asp Ala Ala Val Val Leu 
2035 2040 2045 

Leu Lys Asn Gly Ala Asn Lys Asp Met Gin Asn Asn Arg Glu Glu Thr 
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2050 2055 2060 

Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val 
2065 2070 2075 2080 

Leu Leu Asp His Phe Ala Asn Arg Asp lie Thr Asp His Met Asp Arg 

2085 2090 2095 

Leu Pro Arg Asp lie Ala Gin Glu Arg Met His His Asp lie Val Arg 

2100 2105 2110 

Leu Leu Asp Glu Tyr Asn Leu Val Arg Ser Pro Gin Leu His Gly Ala 
2115 2120 2125 

Pro Leu Gly Gly Thr Pro Thr Leu Ser Pro Pro Leu Cys Ser Pro Asn 
2130 2135 2140 

Gly Tyr Leu Gly Ser Leu Lys Pro Gly Val Gin Gly Lys Lys Val Arg 
2145 2150 2155 2160 

Lys Pro Ser Ser Lys Gly Leu Ala Cys Gly Ser Lys Glu Ala Lys Asp 

2165 2170 2175 

Leu Lys Ala Arg Arg Lys Lys Ser Gin Asp Gly Lys Gly Cys Leu Leu 

2180 2185 2190 

Asp Ser Ser Gly Met Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His 
2195 2200 2205 

Gly Tyr Leu Ser Asp Val Ala Ser Pro Pro Leu Leu Pro Ser Pro Phe 
2210 2215 2220 

Gin Gin Ser Pro Ser Val Pro Leu Asn His Leu Pro Gly Met Pro Asp 
2225 2230 2235 2240 

Thr His Leu Gly lie Gly His Leu Asn Val Ala Ala Lys Pro Glu Met 

2245 2250 2255 

Ala Ala Leu Gly Gly Gly Gly Arg Leu Ala Phe Glu Thr Gly Pro Pro 

2260 2265 2270 

Arg Leu Ser His Leu Pro Val Ala Ser Gly Thr Ser Thr Val Leu Gly 
2275 2280 2285 

Ser Ser Ser Gly Gly Ala Leu Asn Phe Thr Val Gly Gly Ser Thr Ser 
2290 2295 2300 

Leu Asn Gly Gin Cys Glu Trp Leu Ser Arg Leu Gin Ser Gly Met Val 
2305 2310 2315 2320 

Pro Asn Gin Tyr Asn Pro Leu Arg Gly Ser Val Ala Pro Gly Pro Leu 

2325 2330 2335 

Ser Thr Gin Ala Pro Ser Leu Gin His Gly Met Val Gly Pro Leu His 

2340 2345 2350 

Ser Ser Leu Ala Ala Ser Ala Leu Ser Gin Met Met Ser Tyr Gin Gly 
2355 2360 2365 

Leu Pro Ser Thr Arg Leu Ala Thr Gin Pro His Leu Val Gin Thr Gin 
2370 2375 2380 

Gin Val Gin Pro Gin Asn Leu Gin Met Gin Gin Gin Asn Leu Gin Pro 
2385 2390 2395 2400 

Ala Asn lie Gin Gin Gin Gin Ser Leu Gin Pro Pro Pro Pro Pro Pro 

2405 2410 2415 




• 
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Gln Pro His Leu Gly Val Ser Ser Ala Ala Ser Gly His Leu Gly Arg 

2420 2425 2430 

Ser Phe Leu Ser Gly Glu Pro Ser Gin Ala Asp Val Gin Pro Leu Gly 
2435 2440 2445 

Pro Ser Ser Leu Ala Val His Thr He Leu Pro Gin Glu Ser Pro Ala 
2450 2455 2460 

Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr Ala Ala 
2465 2470 2475 2480 

Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro Val Glu 

2485 2490 2495 

Asn Thr Pro Ser His Gin Leu Gin Val Pro Glu His Pro Phe Leu Thr 

2500 2505 2510 

Pro Ser Pro Glu Ser Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser 
2515 2520 2525 

Asn Val Ser Asp Trp Ser Glu Gly Val Ser Ser Pro Pro Thr Ser Met 
2530 2535 2540 

Gin Ser Gin lie Ala Arg He Pro Glu Ala Phe Lys 
2545 2550 2555 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 10.. 7419 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GGAATTCCG CCC GCC CTG CGC CCC GCT CTG CTG TGG GCG CTG CTG GCG 48 

Pro Ala Leu Arg Pro Ala Leu Leu Trp Ala Leu Leu Ala 
15 10 

CTC TGG CTG TGC TGC GCG GCC CCC GCG CAT GCA TTG CAG TGT CGA GAT 96 
Leu Trp Leu Cys Cys Ala Ala Pro Ala His Ala Leu Gin Cys Arg Asp 
15 20 - 25 

GGC TAT GAA CCC TGT GTA AAT GAA GGA ATG TGT GTT ACC TAC CAC AAT 144 
Gly Tyr Glu Pro Cys Val Asn Glu Gly Met Cys Val Thr Tyr His Asn 
30 35 40 45 

GGC ACA GGA TAC TGC AAA TGT CCA GAA GGC TTC TTG GGG GAA TAT TGT 192 
Gly Thr Gly Tyr Cys Lys Cys Pro Glu Gly Phe Leu Gly Glu Tyr Cys 

50 55 60 

CAA CAT CGA GAC CCC TGT GAG AAG AAC CGC TGC CAG AAT GGT GGG ACT 240 
Gin His Arg Asp Pro Cys Glu Lys Asn Arg Cys Gin Asn Gly Gly Thr 

65 70 75 

TGT GTG GCC CAG GCC ATG CTG GGG AAA GCC ACG TGC CGA TGT GCC TCA 288 
Cys Val Ala Gin Ala Met Leu Gly Lys Ala Thr Cys Arg Cys Ala Ser 
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80 85 90 

GGG TTT ACA GGA GAG GAG TGC CAG TAC TCA ACA TCT CAT CCA TGC TTT 336 
Gly Phe Thr Gly Glu Asp Cys Gin Tyr Ser Thr Ser His Pro Cys Phe 
95 100 105 

GTG TCT CGA CCC TGC CTG AAT GGC GGC ACA TGC CAT ATG CTC AGC CGG 384 
Val Ser Arg Pro Cys Leu Asn Gly Gly Thr Cys His Met Leu Ser Arg 
HO 115 120 125 

GAT ACC TAT GAG TGC ACC TGT CAA GTC GGG TTT ACA GGT AAG GAG TGC 432 
Asp Thr Tyr Glu Cys Thr Cys Gin Val Gly Phe Thr Gly Lys Glu Cys 

130 135 140 

CAA TGG ACG GAT GCC TGC CTG TCT CAT CCC TGT GGA AAT GGA AGT ACC 480 
Gin Trp Thr Asp Ala Cys Leu Ser His Pro Cys Ala Asn Gly Ser Thr 

145 150 155 

TGT ACC ACT GTG GCC AAC CAG TTC TCC TGC AAA TGC CTC ACA GGC TTC 528 
Cys Thr Thr Val Ala Asn Gin Phe Ser Cys Lys Cys Leu Thr Gly Phe 
160 165 170 

ACA GGG CAG AAA TGT GAG ACT GAT GTC AAT GAG TGT GAC ATT CCA GGA 576 
Thr Gly Gin Lys Cys Glu Thr Asp Val Asn Glu Cys Asp lie Pro Gly 
175 180 185 

CAC TGC CAG CAT GGT GGC ACC TGC CTC AAC CTG CCT GGT TCC TAC CAG 624 
His Cys Gin His Gly Gly Thr Cys Leu Asn Leu Pro Gly Ser Tyr Gin 
190 195 200 205 

TGC CAG TGC CCT CAG GGC TTC ACA GGC CAG TAC TGT GAC AGC CTG TAT 672 
Cys Gin Cys Pro Gin Gly Phe Thr Gly Gin Tyr Cys Asp Ser Leu Tyr 

210 215 220 

GTG CCC TGT GCA CCC TCA CCT TGT GTC AAT GGA GGC ACC TGT CGG CAG 720 
Val Pro Cys Ala Pro Ser Pro Cys Val Asn Gly Gly Thr Cys Arg Gin 

225 230 235 

ACT GGT GAC TTC ACT TTT GAG TGC AAC TGC CTT CCA GGT TTT GAA GGG 768 
Thr Gly Asp Phe Thr Phe Glu Cys Asn Cys Leu Pro Gly Phe Glu Gly 
240 245 250 

AGC ACC TGT GAG AGG AAT ATT GAT GAC TGC CCT AAC CAC AGG TGT CAG 816 
Ser Thr Cys Glu Arg Asn lie Asp Asp Cys Pro Asn His Arg Cys Gin 
255 260 265 

AAT GGA GGG GTT TGT GTG GAT GGG GTC AAC ACT TAC AAC TGC CGC TGT 864 
Asn Gly Gly Val Cys Val Asp Gly Val Asn Thr Tyr Asn Cys Arg Cys 
270 275 280 285 

CCC CCA CAA TGG ACA GGA CAG TTC TGC ACA GAG GAT GTG GAT GAA TGC 912 
Pro Pro Gin Trp Thr Gly Gin Phe Cys Thr Glu Asp Val Asp Glu Cys 

290 295 300 

CTG CTG CAG CCC AAT GCC TGT CAA AAT GGG GGC ACC TGT GCC AAC CGC 960 
Leu Leu Gin Pro Asn Ala Cys Gin Asn Gly Gly Thr Cys Ala Asn Arg 

305 310 315 

AAT GGA GGC TAT GGC TGT GTA TGT GTC AAC GGC TGG AGT GGA GAT GAC 1008 
Asn Gly Gly Tyr Gly Cys Val Cys Val Asn Gly Trp Ser Gly Asp Asp 
320 325 330 

TGC AGT GAG AAC ATT GAT GAT TGT GCC TTC GCC TCC TGT ACT CCA GGC 1056 
Cys Ser Glu Asn lie Asp Asp Cys Ala Phe Ala Ser Cys Thr Pro Gly 
335 340 345 

TCC ACC TGC ATC GAC CGT GTG GCC TCC TTC TCT TGC ATG TGC CCA GAG 1104 



37 
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Ser Thr Cys He Asp Arg Val Ala Ser Phe Ser C$m Met cys Pro Glu 
350 355 360 365 



GGG AAG GCA GGT CTC CTG TGT CAT CTG GAT GAT GCa TGC 2HC AGC AAT 
Gly Lys Ala Gly Leu Leu Cys His Leu Asp Asp AILa Cys lie Ser Asn 

370 375 380 



1152 



CCT TGC CAC AAG GGG GCA CTG TGT GAC ACC AAC CCTT CTA AAT GGG CAA 
Pro Cys His Lys Gly Ala Leu Cys Asp Thr Asn Piot> Leu £sn Gly Gin 

385 390 395 



1200 



TAT ATT TGC ACC TGC CCA CAA GGC TAC AAA GGG GC3T GAC TCC ACA GAA 
Tyr He Cys Thr Cys Pro Gin Gly Tyr Lys Gly Aim Asp Cys Thr Glu 
400 405 410 



1248 



GAT GTG GAT GAA TGT GCC ATG GCC AAT AGC AAT 
Asp Val Asp Glu Cys Ala Met: Ala Asn Ser Asn 
415 420 



TGT GAG CAT GCA 
Cys Glu His Ala 



1296 



GGA AAA TGT GTG AAC ACG GAT GGC GCC TTC CAC TOT GAG TGT CTG AAG 
Gly Lys Cys Val Asn Thr Asp Gly Ala Phe His Cy® Glu Cys Leu Lys 
430 435 440 445 



1344 



GGT TAT GCA GGA CCT CGT TGT GAG ATG GAC ATC AMT GAG TGC CAT TCA 
Gly Tyr Ala Gly Pro Arg Cys Glu Met Asp He Asm Glu Cys His Ser 

450 455 460 



1392 



GAC CCC TGC CAG AAT GAT GCT ACC TGT CTG GAT 
Asp Pro Cys Gin Asn Asp Ala Thr Cys Leu Asp 

465 470 



ATT GGA GGC TTC 
He Gly Gly Phe 
475 



1440 



ACA TGT CTG TGC ATG CCA GGT TTC AAA GGT GTG CM TGT GAA TTA GAA 
Thr Cys Leu Cys Met Pro Gly Phe Lys Gly Val His Cys Glu Leu Glu 
480 485 490 



1488 



ATA AAT GAA TGT CAG AGC AAC CCT TGT GTG AAC 
He Asn Glu Cys Gin Ser Asn Pro Cys Val Asn 
495 500 



GGG CAG TGT GTG 
Gly Gin Cys Val 



1536 



GAT AAA GTC AAT CGT TTC CAG TGC CTG TGT CCT 
Asp Lys Val Asn Arg Phe Gin Cys Leu Cys Pro 
510 515 520 



GGT TTC ACT GGG 
Gly Phe Thr Gly 

525 



1584 



CCA GTT TGC CAG ATT GAT ATT GAT GAC TGT TCC AGT ACT CCG TGT CTG 
Pro Val Cys Gin He Asp He Asp Asp Cys Ser Ser- Thr Pro Cys Leu 

530 535 540 



1632 



AAT GGG GCA AAG TGT ATC GAT CAC CCG AAT GGC TAT GAA TGC CAG TGT 
Asn Gly Ala Lys Cys He Asp His Pro Asn Gly Tyr Glu Cys Gin Cys 

545 550 555 



1680 



GCC ACA GGT TTC ACT GGT GTG TTG TGT GAG GAG AAC ATT GAC AAC TGT 
Ala Thr Gly Phe Thr Gly Val Leu Cys Glu Glu Asn He Asp Asn Cys 
560 565 570 



1728 



GAC CCC GAT CCT TGC CAC CAT GGT CAG TGT CAG GAT GGT ATT GAT TCC 
Asp Pro Asp Pro Cys His His Gly Gin Cys Gin Asp Gly He Asp Ser 
575 580 585 



1776 



TAC ACC TGC ATC TGC AAT CCC GGG TAC ATG GGC GCC ATC TGC AGT GAC 
Tyr Thr Cys He Cys Asn Pro Gly Tyr Met Gly Ala He Cys Ser Asp 
590 595 600 605 



1824 



CAG ATT GAT GAA TGT TAC AGC AGC CCT TGC CTG AAC GAT GGT CGC TGC 
Gin He Asp Glu Cys Tyr Ser Ser Pro Cys Leu Asn Asp Gly Arg Cys 

610 615 620 



1872 
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ATT GAC CTG GTC AAT GGC TAC GAG TGC AAC TGC C3US CCA GGC ACG TCA 1920 
lie Asp Leu Val Asn Gly Tyr Gin Cys Asn Cys Glna Pro Gly Thr Ser 

625 630 635 

GGG GTT AAT TGT GAA ATT AAT TTT GAT GAC TGT GC&. AGT AAC CCT TGT 1968 
Gly Val Asn Cys Glu lie Asn Phe Asp Asp Cys Ala Ser Asn Pro Cys 
640 645 650 

ATC CAT GGA ATC TGT ATG GAT GGC ATT AAT CGC TAC AGT TCT GTC TGC 2016 
lie His Gly lie Cys Met Asp Gly lie Asn Arg Tyar Ser Cys Val Cys 
655 660 



TCA CCA GGA TTC ACA GGG CAG AGA TGT AAC ATT GAC ATT GAT GAG TGT 2064 
Ser Pro Gly Phe Thr Gly Gin Arg Cys Asn lie Asp lie Asp Glu Cys 
670 675 680 685 

GCC TCC AAT CCC TGT CGC AAG GGT GGA ACA TGT ATC AAC GGT GTG AAT 2112 
Ala Ser Asn Pro Cys Arg Lys Gly Ala Thr Cys He Asn Gly Val Asn 

690 695 700 

GGT TTC CGC TGT ATA TGC CCC GAG GGA CCC CAT CAC CCC AGC TGC TAC 2160 
Gly Phe Arg Cys He Cys Pro Glu Gly Pro His Hie Pro Ser Cys Tyr 

705 710 715 

TCA CAG GTG AAC GAA TGC CTG AGC AAT CCC TGC ATC CAT GGA AAC TGT 2208 
Ser Gin Val Asn Glu Cys Leu Ser Asn Pro Cys lie His Gly Asn Cys 
720 725 730 

ACT GGA GGT CTC AGT GGA TAT AAG TGT CTC TGT GAT GCA GGC TGG GTT 2256 
Thr Gly Gly Leu Ser Gly Tyr Lys Cys Leu Cys Asp Ala Gly Trp Val 
735 740 



GGC ATC AAC TGT GAA GTG GAC AAA AAT GAA TGC CTT TCG AAT CCA TGC 2304 
Gly lie Asn Cys Glu Val Asp Lys Asn Glu Cys Leu Ser Asn Pro Cys 
750 755 760 765 

CAG AAT GGA GGA ACT TGT GAC AAT CTG GTG AAT GGA TAC AGG TGT ACT 2352 
Gin Asn Gly Gly Thr Cys Asp Asn Leu Val Asn Gly Tyr Arg Cys Thr 

770 775 780 

TGC AAG AAG GGC TTT AAA GGC TAT AAC TGC CAG GTG AAT ATT GAT GAA 2400 
Cys Lys Lys Gly Phe Lys Gly Tyr Asn Cys Gin Val Asn lie Asp Glu 

785 790 795 

TGT GCC TCA AAT CCA TGC CTG AAC CAA GGA ACC TGC TTT GAT GAC ATA 2448 
Cys Ala Ser Asn Pro Cys Leu Asn Gin Gly Thr Cys Phe Asp Asp lie 
800 805 810 

AGT GGC TAC ACT TGC CAC TGT GTG CTG CCA TAC ACA GGC AAG AAT TGT 2496 
Ser Gly Tyr Thr Cys His Cys Val Leu Pro Tyr Thr Gly Lys Asn Cys 
815 820 825 

CAG ACA GTA TTG GCT CCC TGT TCC CCA AAC CCT TGT GAG AAT GCT GCT 2544 
Gin Thr Val Leu Ala Pro Cys Ser Pro Asn Pro Cys Glu Asn Ala Ala 
830 835 840 845 

GTT TGC AAA GAG TCA CCA AAT TTT GAG AGT TAT ACT TGC TTG TGT GCT 2592 
Val Cys Lys Glu Ser Pro Asn Phe Glu Ser Tyr Thr Cys Leu Cys Ala 

850 855 860 

CCT GGC TGG CAA GGT CAG CGG TGT ACC ATT GAC ATT GAC GAG TGT ATC 2640 
Pro Gly Trp Gin Gly Gin Arg Cys Thr lie Asp He Asp Glu Cys lie 

865 870 875 

TCC AAG CCC TGC ATG AAC CAT GGT CTC TGC CAT AAC ACC CAG GGC AGC 2688 
Ser Lys Pro Cys Met Asn His Gly Leu Cys His Asn Thr Gin Gly Ser 
880 885 890 
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TAC ATG TGT GAA TGT CCA CCA GGC TTC AGT GGT ATG GAC TGT GAG GAG 2736 
Tyr Met Cys Glu Cys Pro Pro Gly Phe Ser Gly Met Asp Cys Glu Glu 
895 900 905 

GAC ATT GAT GAC TGC CTT GCC AAT CCT TGC CAG AAT GGA GGT TCC TGT 2784 
Asp lie Asp Asp Cys Leu Ala Asn Pro Cys Gin Asn Gly Gly Ser Cys 
910 915 920 925 

ATG GAT GGA GTG AAT ACT TTC TCC TGC CTC TGC CTT CCG GGT TTC ACT 2832 
Met Asp Gly Val Asn Thr Phe Ser Cys Leu Cys Leu Pro Gly Phe Thr 

930 935 940 

GGG GAT AAG TGC CAG ACA GAC ATG AAT GAG TGT CTG AGT GAA CCC TGT 2880 
Gly Asp Lys Cys Gin Thr Asp Met Asn Glu Cys Leu Ser Glu Pro Cys 

945 950 955 

AAG AAT GGA GGG ACC TGC TCT GAC TAC GTC AAC AGT TAC ACT TGC AAG 2928 
Lys Asn Gly Gly Thr Cys Ser Asp Tyr Val Asn Ser Tyr Thr Cys Lys 
960 965 970 

TGC CAG GCA GGA TTT GAT GGA GTC CAT TGT GAG AAC AAC ATC AAT GAG 2976 
Cys Gin Ala Gly Phe Asp Gly Val His Cys Glu Asn Asn lie Asn Glu 
975 980 985 

TGC ACT GAG AGC TCC TGT TTC AAT GGT GGC ACA TGT GTT GAT GGG ATT 3024 
Cys Thr Glu Ser Ser Cys Phe Asn Gly Gly Thr Cys Val Asp Gly lie 
990 995 1000 1005 

AAC TCC TTC TCT TGC TTG TGC CCT GTG GGT TTC ACT GGA TCC TTC TGC 3072 
Asn Ser Phe Ser Cys Leu Cys Pro Val Gly Phe Thr Gly Ser Phe Cys 

1010 1015 1020 

CTC CAT GAG ATC AAT GAA TGC AGC TCT CAT CCA TGC CTG AAT GAG GGA 3120 
Leu His Glu lie Asn Glu Cys Ser Ser His Pro Cys Leu Asn Glu Gly 

1025 1030 1035 

ACG TGT GTT GAT GGC CTG GGT ACC TAC CGC TGC AGC TGC CCC CTG GGC 3168 
Thr Cys Val Asp Gly Leu Gly Thr Tyr Arg Cys Ser Cys Pro Leu Gly 
1040 1045 1050 

TAC ACT GGG AAA AAC TGT CAG ACC CTG GTG AAT CTC TGC AGT CGG TCT 3216 
Tyr Thr Gly Lys Asn Cys Gin Thr Leu Val Asn Leu Cys Ser Arg Ser 
1055 1060 1065 

CCA TGT AAA AAC AAA GGT ACT TGT GTT CAG AAA AAA GCA GAG TCC CAG 3264 
Pro Cys Lys Asn Lys Gly Thr Cys Val Gin Lys Lys Ala Glu Ser Gin 
1070 1075 1080 1085 

TGC CTA TGT CCA TCT GGA TGG GCT GGT GCC TAT TGT GAC GTG CCC AAT 3312 
Cys Leu Cys Pro Ser Gly Trp Ala Gly Ala Tyr Cys Asp Val Pro Asn 

1090 1095 1100 

GTC TCT TGT GAC ATA GCA GCC TCC AGG AGA GGT GTG CTT GTT GAA CAC 3360 
Val Ser Cys Asp lie Ala Ala Ser Arg Arg Gly Val Leu Val Glu His 

1105 1110 1115 

TTG TGC CAG CAC TCA GGT GTC TGC ATC AAT GCT GGC AAC ACG CAT TAC . 3408 
Leu Cys Gin His Ser Gly Val Cys lie Asn Ala Gly Asn Thr His Tyr 
1120 1125 1130 

TGT CAG TGC CCC CTG GGC TAT ACT GGG AGC TAC TGT GAG GAG CAA CTC 3456 
Cys Gin Cys Pro Leu Gly Tyr Thr Gly Ser Tyr Cys Glu Glu Gin Leu 
1135 1140 1145 

GAT GAG TGT GCG TCC AAC CCC TGC CAG CAC GGG GCA ACA TGC AGT GAC 3504 
Asp Glu Cys Ala Ser Asn Pro Cys Gin His Gly Ala Thr Cys Ser Asp 
1150 1155 1160 1165 
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TTC ATT GGT GGA TAC AGA TGC GAG TGT GTC CCA GGC TAT CAG GGT GTC 3552 

Phe lie Gly Gly Tyr Arg Cya Glu Cys Val Pro Gly Tyr Gin Gly Val 

1170 1175 1180 

AAC TGT GAG TAT GAA GTG GAT GAG TGC CAG AAT CAG CCC TGC CAG AAT 3600 
Asn Cys Glu Tyr Glu Val Asp Glu Cys Gin Asn Gin Pro Cys Gin Asn 

1185 1190 1195 

GGA GGC ACC TGT ATT GAC CTT GTG AAC CAT TTC AAG TGC TCT TGC CCA 3648 
Gly Gly Thr Cys lie Asp Leu Val Asn His Phe Lys Cys Ser Cys Pro 
1200 1205 1210 

CCA GGC ACT CGG GGC CTA CTC TGT GAA GAG AAC ATT GAT GAC TGT GCC 3696 
Pro Gly Thr Arg Gly Leu Leu Cys Glu Glu Asn lie Asp Asp Cys Ala 
1215 1220 1225 

CGG GGT CCC CAT TGC CTT AAT GGT GGT CAG TGC ATG GAT AGG ATT GGA 3744 
Arg Gly Pro His Cys Leu Asn Gly Gly Gin Cys Met Asp Arg lie Gly 
1230 1235 1240 1245 

GGC TAC AGT TGT CGC TGC TTG CCT GGC TTT GCT GGG GAG CGT TGT GAG 3792 
Gly Tyr Ser Cys Arg Cys Leu Pro Gly Phe Ala Gly Glu Arg Cys Glu 

1250 1255 1260 

GGA GAC ATC AAC GAG TGC CTC TCC AAC CCC TGC AGC TCT GAG GGC AGC 3840 
Gly Asp lie Asn Glu Cys Leu Ser Asn Pro Cys Ser Ser Glu Gly Ser 

1265 1270 1275 

CTG GAC TGT ATA CAG CTC ACC AAT GAC TAC CTG TGT GTT TGC CGT AGT 3888 
Leu Asp Cys lie Gin Leu Thr Asn Asp Tyr Leu Cys Val Cys Arg Ser 
1280 1285 1290 

GCC TTT ACT GGC CGG CAC TGT GAA ACC TTC GTC GAT GTG TGT CCC CAG 3936 
Ala Phe Thr Gly Arg His Cys Glu Thr Phe Val Asp Val Cys Pro Gin 
1295 1300 1305 

ATG CCC TGC CTG AAT GGA GGG ACT TGT GCT GTG GCC AGT AAC ATG CCT 3984 
Met Pro Cys Leu Asn Gly Gly Thr Cys Ala Val Ala Ser Asn Met Pro 
1310 1315 1320 1325 

GAT GGT TTC ATT TGC CGT TGT CCC CCG GGA TTT TCC GGG GCA AGG TGC 4032 
Asp Gly Phe lie Cys Arg Cys Pro Pro Gly Phe Ser Gly Ala Arg Cys 

1330 1335 1340 

CAG AGC AGC TGT GGA CAA GTG AAA TGT AGG AAG GGG GAG CAG TGT GTG 4080 
Gin Ser Ser Cys Gly Gin Val Lys Cys Arg Lys Gly Glu Gin Cys Val 

1345 1350 1355 

CAC ACC GCC TCT GGA CCC CGC TGC TTC TGC CCC AGT CCC CGG GAC TGC 4128 
His Thr Ala Ser Gly Pro Arg Cys Phe Cys Pro Ser Pro Arg Asp Cys 
1360 1365 1370 

GAG TCA GGC TGT GCC AGT AGC CCC TGC CAG CAC GGG GGC AGC TGC CAC 4176 
Glu Ser Gly Cys Ala Ser Ser Pro Cys Gin His Gly Gly Ser Cys His 
1375 1380 1385 

CCT CAG CGC CAG CCT CCT TAT TAC TCC TGC CAG TGT GCC CCA CCA TTC 4224 
Pro Gin Arg Gin Pro Pro Tyr Tyr Ser Cys Gin Cys Ala Pro Pro Phe 
1390 1395 1400 1405 

TCG GGT AGC CGC TGT GAA CTC TAC ACG GCA CCC CCC AGC ACC CCT CCT 4272 
Ser Gly Ser Arg Cys Glu Leu Tyr Thr Ala Pro Pro Ser Thr Pro Pro 

1410 1415 1420 

GCC ACC TGT CTG AGC CAG TAT TGT GCC GAC AAA GCT CGG GAT GGC GTC 4320 
Ala Thr Cys Leu Ser Gin Tyr Cys Ala Asp Lys Ala Arg Asp Gly Val 

1425 1430 1435 
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TGT GAT GAG GCC TGC AAC AGC CAT GCC TGC CAG TGG GAT GGG GGT GAC 
Cys Asp Glu Ala Cya Asn Ser His Ala Cys Gin Trp Asp Gly Gly Asp 
1440 1445 1450 

TGT TCT CTC ACC ATG GAG AAC CCC TGG GCC AAC TGC TCC TCC CCA CTT 
Cys Ser Leu Thr Met Glu Asn Pro Trp Ala Asn Cys Ser Ser Pro Leu 
1455 1460 1465 

CCC T<JC TGG GAT TAT ATC AAC AAC CAG TGT GAT GAG CTG TGC AAC ACG 
Pro Cys Trp Asp Tyr lie Asn Asn Gin Cys Asp Glu Leu Cys Asn Thr 
1470 1475 1480 1485 

GTC GAG TGC CTG TTT GAC AAC TTT GAA TGC CAG GGG AAC AGC AAG ACA 
Val Glu Cys Leu Phe Asp Asn Phe Glu Cys Gin Gly Asn Ser Lys Thr 

1490 1495 1500 

TGC AAG TAT GAC AAA TAC TGT GCA GAC CAC TTC AAA GAC AAC CAC TGT 
Cys Lys Tyr Asp Lys Tyr Cys Ala Asp His Phe Lys Asp Asn His Cys 

1505 1510 1515 

AAC CAG GGG TGC AAC AGT GAG GAG TGT GGT TGG GAT GGG CTG GAC TGT 
Asn Gin Gly Cys Asn Ser Glu Glu Cys Gly Trp Asp Gly Leu Asp Cys 
1520 1525 1530 

OCT GCT GAC CAA CCT GAG AAC CTG GCA GAA GGT ACC CTG GTT ATT GTG 
Ala Ala Asp Gin Pro Glu Asn Leu Ala Glu Gly Thr Leu Val lie Val 
1535 1540 1545 

GTA TTG ATG CCA CCT GAA CAA CTG CTC CAG GAT GCT CGC AGC TTC TTG 
Val Leu Met Pro Pro Glu Gin Leu Leu Gin Asp Ala Arg Ser Phe Leu 
1550 1555 1560 1565 

CGG GCA CTG GGT ACC CTG CTC CAC ACC AAC CTG CGC ATT AAG CGG GAC 
Arg Ala Leu Gly Thr Leu Leu His Thr Asn Leu Arg lie Lys Arg Asp 

1570 1575 1580 

TCC CAG GGG GAA CTC ATG GTG TAC CCC TAT TAT GGT GAG AAG TCA GCT 
Ser Gin Gly Glu Leu Met Val Tyr Pro Tyr Tyr Gly Glu Lys Ser Ala 

1585 1590 1595 

GCT ATG AAG AAA CAG AGG ATG ACA CGC AGA TCC CTT CCT GGT GAA CAA 
Ala Met Lys Lys Gin Arg Met Thr Arg Arg Ser Leu Pro Gly Glu Gin 
1600 1605 1610 

GAA CAG GAG GTG GCT GGC TCT AAA GTC TTT CTG GAA ATT GAC AAC CGC 
Glu Gin Glu Val Ala Gly Ser Lys Val Phe Leu Glu lie Asp Asn Arg 
1615 1620 1625 

CAG TGT GTT CAA GAC TCA GAC CAC TGC TTC AAG AAC ACG GAT GCA GCA 
Gin Cys Val Gin Asp Ser Asp His Cys Phe Lys Asn Thr Asp Ala Ala 
1630 1635 1640 1645 

GCA GCT CTC CTG GCC TCT CAC GCC ATA CAG GGG ACC CTG TCA TAC CCT 
Ala Ala Leu Leu Ala Ser His Ala lie Gin Gly Thr Leu Ser Tyr Pro 

1650 1655 1660 

CTT GTG TCT GTC GTC AGT GAA TCC CTG ACT CCA GAA CGC ACT CAG* CTC 
Leu Val Ser Val Val Ser Glu Ser Leu Thr Pro Glu Arg Thr Gin Leu 

1665 1670 1675 

CTC TAT CTC CTT GCT GTT GCT GTT GTC ATC ATT CTG TTT ATT ATT CTG 
Leu Tyr Leu Leu Ala Val Ala Val Val lie lie Leu Phe lie lie Leu 
1680 1685 1690 

CTG GGG GTA ATC ATG GCA AAA CGA AAG CGT AAG CAT GGC TCT CTC TGG 
Leu Gly Val lie Met Ala Lys Arg Lys Arg Lys His Gly Ser Leu Trp 
1695 1700 1705 



4368 



4416 



4464 



4512 



4560 



4608 



4656 



4704 



4752 



4800 



4848 



4896 



4944 



4992 



5040 



5088 



5136 
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CTG CCT GAA GGT TTC ACT CTT CGC CGA GAT GCA AGC AAT CAC AAG CGT 5184 
Leu Pro Glu Gly Phe Thr Leu Arg Arg Asp Ala Ser Asn His Lys Arg 
1710 1715 1720 1725 

CGT GAG CCA GTG GGA CAG GAT GCT GTG GGG CTG AAA AAT CTC TCA GTG 5232 
Arg Glu Pro Val Gly Gin Asp Ala Val Gly Leu Lys Asn Leu Ser Val 

1730 1735 1740 

CAA GTC TCA GAA GCT AAC CTA ATT GGT ACT GGA ACA AGT GAA CAC TGG 5280 
Gin Val Ser Glu Ala Asn Leu lie Gly Thr Gly Thr Ser Glu His Trp 

1745 1750 1755 

GTC GAT GAT GAA GGG CCC CAG CCA AAG AAA GTA AAG GCT GAA GAT GAG 5328 
Val Asp Asp Glu Gly Pro Gin Pro Lys Lys Val Lye Ala Glu Asp Glu 
1760 1765 1770 

GCC TTA CTC TCA GAA GAA GAT GAC CCC ATT GAT CGA CGG CCA TGG ACA 5376 
Ala Leu Leu Ser Glu Glu Asp Asp Pro lie Asp Arg Arg Pro Trp Thr 
1775 1780 1785 

CAG CAG CAC CTT GAA GCT GCA GAC ATC CGT AGG ACA CCA TCG CTG GCT 5424 
Gin Gin His Leu Glu Ala Ala Asp lie Arg Arg Thr Pro Ser Leu Ala 
1790 1795 1800 1805 

CTC ACC CCT CCT CAG GCA GAG CAG GAG GTG GAT GTG TTA GAT GTG AAT 5472 
Leu Thr Pro Pro Gin Ala Glu Gin Glu Val Asp Val Leu Asp Val Asn 

1810 1815 1820 

GTC CGT GGC CCA GAT GGC TGC ACC CCA TTG ATG TTG GCT TCT CTC CGA 5520 
Val Arg Gly Pro Asp Gly Cys Thr Pro Leu Met Leu Ala Ser Leu Arg 

1825 1830 1835 

GGA GGC AGC TCA GAT TTG AGT GAT GAA GAT GAA GAT GCA GAG GAC TCT 5568 
Gly Gly Ser Ser Asp Leu Ser Asp Glu Asp Glu Asp Ala Glu Asp Ser 
1840 1845 1850 

TCT GCT AAC ATC ATC ACA GAC TTG GTC TAC CAG GGT GCC AGC CTC CAG 5616 
Ser Ala Asn He He Thr Asp Leu Val Tyr Gin Gly Ala Ser Leu Gin 
1855 1860 1865 

GCC CAG ACA GAC CGG ACT GGT GAG ATG GCC CTG CAC CTT GCA GCC CGC 5664 
Ala Gin Thr Asp Arg Thr Gly Glu Met Ala Leu His Leu Ala Ala Arg 
1870 1875 1880 1885 

TAC TCA CGG GCT GAT GCT GCC AAG CGT CTC CTG GAT GCA GGT GCA GAT 5712 
Tyr Ser Arg Ala Asp Ala Ala Lys Arg Leu Leu Asp Ala Gly Ala Asp 

1890 1895 1900 

GCC AAT GCC CAG GAC AAC ATG GGC CGC TGT CCA CTC CAT GCT GCA GTG 5 760 

Ala Asn Ala Gin Asp Asn Met Gly Arg Cys Pro Leu His Ala Ala Val 

1905 1910 1915 

GCA GCT GAT GCC CAA GGT GTC TTC CAG ATT CTG ATT CGC AAC CGA GTA 5808 
Ala Ala Asp Ala Gin Gly Val Phe Gin He Leu He Arg Asn Arg Val 
1920 1925 1930 

ACT GAT CTA GAT GCC AGG ATG AAT GAT GGT ACT ACA CCC CTG ATC CTG 5856 
Thr Asp Leu Asp Ala Arg Met Asn Asp Gly Thr Thr Pro Leu He Leu 
1935 1940 1945 

GCT GCC CGC CTG GCT GTG GAG GGA ATG GTG GCA GAA CTG ATC AAC TGC 5904 
Ala Ala Arg Leu Ala Val Glu Gly Met Val Ala Glu Leu He Asn Cys 
1950 1955 1960 1965 

CAA GCG GAT GTG AAT GCA GTG GAT GAC CAT GGA AAA TCT GCT CTT CAC 5952 
Gin Ala Asp Val Asn Ala Val Asp Asp His Gly Lys Ser Ala Leu His 

1970 1975 1980 
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TGG GCA GCT GCT GTC AAT AAT GTG GAG GCA ACT CTT TTG TTG TTG AAA 
Trp Ala Ala Ala Val Asn Asn Val Glu Ala Thr Leu Leu Leu Leu Lys 

1985 1990 1995 

AAT GGG GCC AAC CGA GAC ATG CAG GAC AAC AAG GAA GAG ACA CCT CTG 
Asn Gly Ala Asn Arg Asp Met Gin Asp Asn Lys Glu Glu Thr Pro Leu 
2000 2005 2010 

TTT CTT GCT GCC CGG GAG GGG AGC TAT GAA GCA GCC AAG ATC CTG TTA 
Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Ala Ala Lys lie Leu Leu 
2015 2020 2025 

GAC CAT TTT GCC AAT CGA GAC ATC ACA GAC CAT ATG GAT CGT CTT CCC 
Asp His Phe Ala Asn Arg Asp lie Thr Asp His Met Asp Arg Leu Pro 
2030 2035 2040 2045 

CGG GAT GTG GCT CGG GAT CGC ATG CAC CAT GAC ATT GTG CGC CTT CTG 
Arg Asp Val Ala Arg Asp Arg Met His His Asp He Val Arg Leu Leu 

2050 2055 2060 

GAT GAA TAC AAT GTG ACC CCA AGC CCT CCA GGC ACC GTG TTG ACT TCT 
Asp Glu Tyr Asn Val Thr Pro Ser Pro Pro Gly Thr Val Leu Thr Ser 

2065 2070 2075 

GCT CTC TCA CCT GTC ATC TGT GGG CCC AAC AGA TCT TTC CTC AGC CTG 
Ala Leu Ser Pro Val He Cys Gly Pro Asn Arg Ser Phe Leu Ser Leu 
2080 2085 2090 

AAG CAC ACC CCA ATG GGC AAG AAG TCT AGA CGG CCC AGT GCC AAG AGT 
Lys His Thr Pro Met Gly Lys Lys Ser Arg Arg Pro Ser Ala Lys Ser 
2095 2100 2105 

ACC ATG CCT ACT AGC CTC CCT AAC CTT GCC AAG GAG GCA AAG GAT GCC 
Thr Met Pro Thr Ser Leu Pro Asn Leu Ala Lys Glu Ala Lys Asp Ala 
2110 2115 2120 2125 

AAG GGT AGT AGG AGG AAG AAG TCT CTG AGT GAG AAG GTC CAA CTG TCT 
Lys Gly Ser Arg Arg Lys Lys Ser Leu Ser Glu Lys Val Gin Leu Ser 

2130 2135 2140 

GAG AGT TCA GTA ACT TTA TCC CCT GTT GAT TCC CTA GAA TCT CCT CAC 
Glu Ser Ser Val Thr Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His 

2145 2150 2155 

ACG TAT GTT TCC GAC ACC ACA TCC TCT CCA ATG ATT ACA TCC CCT GGG 
Thr Tyr Val Ser Asp Thr Thr Ser Ser Pro Met He Thr Ser Pro Gly 
2160 . 2165 2170 

ATC TTA CAG GCC TCA CCC AAC CCT ATG TTG GCC ACT GCC GCC CCT CCT 
lie Leu Gin Ala Ser Pro Asn Pro Met Leu Ala Thr Ala Ala Pro Pro 
2175 2180 2185 

GCC CCA GTC CAT GCC CAG CAT GCA CTA TCT TTT TCT AAC CTT CAT GAA 
Ala Pro Val His Ala Gin His Ala Leu Ser Phe Ser Asn Leu His Glu 
2190 2195 2200 2205 

ATG CAG CCT TTG GCA CAT GGG GCC AGC ACT GTG CTT CCC TCA GTG AGC 
Met Gin Pro Leu Ala His Gly Ala Ser Thr Val Leu Pro Ser Val Ser 

2210 2215 2220 

CAG TTG CTA TCC CAC CAC CAC ATT GTG TCT CCA GGC AGT GGC AGT GCT 
Gin Leu Leu Ser His His His He Val Ser Pro Gly Ser Gly Ser Ala 

2225 2230 2235 

GGA AGC TTG AGT AGG CTC CAT CCA GTC CCA GTC CCA GCA GAT TGG ATG 
Gly Ser Leu Ser Arg Leu His Pro Val Pro Val Pro Ala Asp Trp Met 
2240 2245 2250 



6000 



6048 



6096 



6144 



6192 



6240 



6288 



6336 



6384 



6432 



6480 



6528 



6576 



6624 



6672 



6720 



6768 
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AAC CGC ATG GAG GTG AAT GAG ACC CAG TAC AAT GAG ATG TTT GGT ATG 
Asn Arg Met Glu Val Asn Glu Thr Gin Tyr Asn Glu Met Phe Gly Met 
2255 2260 2265 



6816 



GTC CTG GCT CCA GCT GAG GGC ACC CAT CCT GGC ATA GCT CCC CAG AGC 
Val Leu Ala Pro Ala Glu Gly Thr His Pro Gly lie Ala Pro Gin Ser 
2270 2275 2280 2285 



6864 



AGG CCA CCT GAA GGG AAG CAC ATA ACC ACC CCT CGG GAG CCC TTG CCC 
Arg Pro Pro Glu Gly Lys His lie Thr Thr Pro Arg Glu Pro Leu Pro 

2290 2295 2300 



6912 



CCC ATT GTG ACT TTC CAG CTC ATC CCT AAA GGC AGT ATT GCC CAA CCA 
Pro lie Val Thr Phe Gin Leu lie Pro Lys Gly Ser lie Ala Gin Pro 

2305 2310 2315 



6960 



GCG GGG GCT CCC CAG CCT CAG TCC ACC TGC CCT CCA GCT GTT GCG GGC 
Ala Gly Ala Pro Gin Pro Gin Ser Thr Cys Pro Pro Ala Val Ala Gly 
2320 2325 2330 



7008 



CCC CTG CCC ACC ATG TAC CAG ATT CCA GAA ATG GCC CGT TTG CCC AGT 

Pro Leu Pro Thr Met Tyr Gin lie Pro Glu Met Ala Arg Leu Pro Ser 
2335 2340 2345 



7056 



GTG GCT TTC CCC ACT GCC ATG ATG CCC CAG CAG GAC GGG CAG GTA GCT 
Val Ala Phe Pro Thr Ala Met Met Pro Gin Gin Asp Gly Gin Val Ala 
2350 2355 2360 2365 



7104 



CAG ACC ATT CTC CCA GCC TAT CAT CCT TTC CCA GCC TCT GTG GGC AAG 
Gin Thr lie Leu Pro Ala Tyr His Pro Phe Pro Ala Ser Val Gly Lys 

2370 2375 2380 



7152 



TAC CCC ACA CCC CCT TCA CAG CAC AGT TAT GCT TCC TCA AAT GCT GCT 
Tyr Pro Thr Pro Pro Ser Gin His Ser Tyr Ala Ser Ser Asn Ala Ala 

2385 2390 2395 



7200 



GAG CGA ACA CCC AGT CAC AGT GGT CAC CTC CAG GGT GAG CAT CCC TAC 
Glu Arg Thr Pro Ser His Ser Gly His Leu Gin Gly Glu His Pro Tyr 
2400 2405 2410 



7248 



CTG ACA CCA TCC CCA GAG TCT CCT GAC CAG TGG TCA AGT TCA TCA CCC 
Leu Thr Pro Ser Pro Glu Ser Pro Asp Gin Trp Ser Ser Ser Ser Pro 
2415 2420 2425 



7296 



CAC TCT GCT TCT GAC TGG TCA GAT GTG ACC ACC AGC CCT ACC CCT GGG 
His Ser Ala Ser Asp Trp Ser Asp Val Thr Thr Ser Pro Thr Pro Gly 
2430 2435 - 2440 2445 



7344 



GGT GCT GGA GGA GGT CAG CGG GGA CCT GGG ACA CAC ATG TCT GAG CCA 
Gly Ala Gly Gly Gly Gin Arg Gly Pro Gly Thr His Met Ser Glu Pro 

2450 2455 2460 



7392 



CCA CAC AAC AAC ATG CAG GTT TAT GCG TGAGAGAGTC CACCTCCAGT 
Pro His Asn Asn Met Gin Val Tyr Ala 

2465 2470 



7439 



GTAGAGACAT AACTGACTTT TGTAAATGCT GCTGAGGAAC AAATGAAGGT CATCCGGGAG 
AGAAATGAAG AAATCTCTGG AG CC AG CT TC TAGAGGTAGG AAAGAGAAGA TGTTCTTATT 
CAGATAATGC AAGAGAAGCA ATTCGTCAGT TTCACTGGGT ATCTGCAAGG CTTATTGATT 
AT TCT AATCT • AATAAGACAA GTTTGTGGAA ATG C AAG ATG AATACAAGCC TTGGGTCCAT 
GTTTACTCTC TTCTATTTGG AG AAT AAG AT GGATGCTTAT TGAAGCCCAG ACATTCTTGC 
AGCTTGGACT GCATTTTAAG CCCTGCAGGC TTCTGCCATA TCCATGAGAA GATTCTACAC 



7499 
7559 
7619 
7679 
7739 
7799 
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TAGCGTCCTG 


TTGGGAATTA 


TGCCCTGGAA 


TTCTGCCTGA 


ATTGACCTAC 


GCATCTCCTC 


7*859 


CTCCTTGGAC 


ATTCTTTTGT 


CTTCATTTGG 


TGCTTTTGGT 


TTTGCACCTC 


TCCGTGATTG 


7919 


TAGCCCTACC 


AGCATGTTAT 


AGGGCAAGAC 


CTTTGTGCTT 


TTGATCATTC 


TGGCCCATGA . ^ 

V . . 


7979 


AAGCAACTTT 


GGTCTCCTTT 


CCCCTCCTGT 


CTTCCCGGTA 


TCCCTTGGAG 


TCTCACAAGG 


8039 


TTTACXTTGG 


TATGGTTCTC 


AGCACAAACC 


TTTCAAGTAT 


GTTGTTTCTT 


TGGAAAATGG 


8099 


ACATACTGTA 


TTGTGTTCTC 


CTGCATATAT 


CATTCCTGGA 


GAGAGAAGGG 


GAGAAGAATA 


8159 


CTTTTCTTCA 


ACAAATTTTG 


GGGGCAGGAG 


ATCCCTTCAA 


GAGGCTGCAC 


CTTAATTTTT 


8219 


CTTGTCTGTG 


TGCAGGTCTT 


CATATAAACT 


TTACCAGGAA 


GAAGGGTGTG 


AGTTTGTTGT 


8279 


TTTTCTGTGT 


ATGGGCCTGG 


TCAGTGTAAA 


GTTTTATCCT 


TGATAGTCTA 


GTTACTATGA 


8339 


CCCTCCCCAC 


TTTTTTAAAA 


CCAGAAAAAG 


GTTTGGAATG 


TTGGAATGAC 


CAAGAGACAA 


8399 


GTTAACTCGT 


GCAAGAGCCA 


GTTACCCACC 


CACAGGTCCC 


CCTACTTCCT 


GCCAAGCATT 


8459 


CCATTGACTG 


CCTGTATGGA 


ACACATTTGT 


CCCAGATCTG 


AGCATTCTAG 


GCCTGTTTCA 


8519 


CTCACTCACC 


CAGCATATGA 


AACTAGTCTT 


AACTGTTGAG 


CCTTTCCTTT 


CATATCCACA 


8579 


GAAGACACTG 


TCTCAAATGT 


TGTACCCTTG 


CCATTTAGGA 


CTGAACTTTC 


CTTAGCCCAA 


8639 


GGGACCCAGT 


GACAGTTGTC 


TTCCGTTTGT 


CAGATGATCA 


GTCTCTACTG 


ATTATCTTGC 


8699 


TGCTTAAAGG 


CCTGCTCACC 


AATCTTTCTT 


TCACACCGTG 


TGGTCCGTGT 


TACTGGTATA 


8759 


CCCAGTATGT 


TCTCACTGAA 


GACATGGACT 


TTATATGTTC 


AAGTGCAGGA 


ATTGGAAAGT 


8819 


TGGACTTGTT 


TTCTATGATC 


CAAAAC AG CC 


CTATAAGAAG 


GTTGGAAAAG 


GAGGAACTAT 


8879 


ATAGCAGCCT 


TTGCTATTTT 


CTGCTACCAT 


TTCTTTTCCT 


CTGAAGCGGC 


CATGACATTC 


8939 


CCTTTGGCAA 


CTAACGTAGA 


AACTCAACAG 


AACATTTTCC 


TTTCCTAGAG 


TCACCTTTTA 


8999 


GATGATAATG 


GACAACTATA 


GACTTGCTCA 


TTGTTCAGAC 


TGATTGCCCC 


TCACCTGAAT 


9059 


CCACTCTCTG 


TATTCATGCT 


CTTGGCAATT 


TCTTTGACTT 


TCTTTTAAGG 


GCAGAAGCAT 


9119 


TTTAGTTAAT 


TG TAG AT AAA 


GAATAGTTTT 


CTTCCTCTTC 


TCCTTGGGCC 


AGTTAATAAT 


9179 


TGGTCCATGG 


CTACACTGCA 


ACTTCCGTCC 


AGTGCTGTGA 


TGCCCATGAC 


ACCTGCAAAA 


9239 


TAAGTTCTGC 


CTGGGCATTT 


TGTAGATATT 


AACAGGTGAA 


TTCCCGACTC 


TTTTGGTTTG 


9299 


AATGACAGTT 


CTCATTCCTT 


CTATGGCTGC 


AAGT ATG CAT 


CAGTGCTTCC 


CACTTACCTG 


9359 


ATTTGTCTGT 


CGGTGGCCCC 


ATATGGAAAC 


CCTGCGTGTC 


TGTTGGCATA 


ATAGTTTACA 


9419 


AATGGTTTTT 


TCAGTCCTAT 


CCAAATTTAT 


TGAACCAACA 


AAAATAATTA 


CTTCTGCCCT 


9479 


G AG AT AAG C A 


GATTAAGTTT 


GTTCATTCTC 


TGCTTTATTC 


TCTCCATGTG 


GCAACATTCT 


9539 


GTCAGCCTCT 


TTCATAGTGT 


GCAAACATTT 


TATCATTCTA 


AATGGTGACT 


CTCTGCCCTT 


9599 


GGACCCATTT 


ATTATTCACA 


GATGGGGAGA 


ACCTATCTGC 


ATGGACCCTC 


ACCATCCTCT 


9659 


GTGCAGCACA 


CACAGTGCAG 


GGAGCCAGTG 


GCGATGGCGA 


TGACTTTCTT 


CCCCTGGGAA 


9719 


TTCC 












9723 
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